PCT US99 23^3 



VTENT COOPERATION TRE, . Y 

From the INTERNATIONAL BUREAU 



PCT 

NOTIFICATION OF ELECTION 

(PCT Rule 61.2) 


Accict^nt C n m m iccinnpr f r P A t P n t ^ 

United States Patent and Trademark 

Office 

Box PCT 

Washington, D.C. 20231 
ETATS-UNIS D AMERIQUE 

r> ! ts capacity as electee 0 4r i.e 


Date of mailing (day month yean 
16 June 2000 (16.06.00) 




International application No. 

PCT US99 23731 


Applicant's or agent's file reference 

235.00010201 


International filing date (day month year) 

12 October 1999 (12.10.99) 


Pnorrty date [day month yean 

13 October 1998 (13.10.98) 


Applicant 

ALTMAN, Elliot 



1. The designated Office is hereby notified of its election made: 



X in the demand filed with the International Preliminary Examining Authority on; 

11 May 2000 (11.05.00) 

[ | m a notice effecting later election filed with the International Bureau on: 



2. The election 



0 
□ 



made before the expiration of 19 months from the priority date or, where Rule 32 applies, within the time limit under 
Rule 32.2(b). 





A^orzeci o"'ce r 




The International Bureau of WIPO 






34, chemin des Colombettes 




Manu Berrod 


1211 Geneva 20, Switzerland 







Facsim e No. i a 1 -22 ) 740.14 35 



3 3o 83.3c 



-o r m PC T ! B 33 1 fJ ! Ji/ 19921 



. W S9923^ 



PATENT COOPERATION TREATY 

PCT 



INTERNATIONAL SEARCH REPORT 

(PCT Article 18 and Rules 43 and 44) 



Applicants or agenf s fie reference 
235.00010201 
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ACTION 


International application No. 
PCT/US 99/23731 


International filing date (day/month/year) 

12/10/1999 


(Earnest) Priority Date (day/month/year) 

13/10/1998 


Applicant 

THE UNIVERSITY OF GEORGIA RESEARCH ....et al 



This International Search Report has been prepared by this International Searching Authority and te transmitted to the applicant 
according to Article 1 8. A copy Is being transmitted to the International Bureau. 

This International Search Report consists of a total of 5 sheets. 

[X] It Is also accompanied by a copy of each prior art document cited In this report 



1. Baals erf the report 

a. With regard to the language, the International search was carried out on the basis of the International application In the 
language In which It was filed, unless otherwise Indicated under this Item. 

I I the International search was carried out on the basis of a translation of the International application furnished to this 
Authority (Rule 23.1 (b)). 

b. With regard to any nucleotide and/or amino acid sequence disclosed In the International application, the International search 
was carried out on the basis of the sequence listing : 

|X| contained In the International application In written form. 

| | filed together wtth the International application In computer readable form. 

| | furnished subsequently to ihte Authority In written form. 

|X] furnished subsequently to this Authority In computer readble form. 

fXj the statement that the subsequently furnished written sequence listing does not go beyond the disclosure In the 
International application as filed has been furnished. 

[X] the statement that the Information recorded In computer readable form Is Identical to the written sequence listing has been 
furnished 



2. [X] Certain claim* were found unse a rchable (See Box 1). 

3. |~| Unity of invention ie lacking (see Box II). 

4. With regard to the title, 

(X] the text Is approved as submitted by the applicant. 

| | the text has been established by this Authority to read as follows: 



5. Wtth regard to the abstract 

[X] the text Is approved as submitted by the applicant. 

□ the text has been established, according to Rule 38.2(b), by this Authority as It appears In Box III. The applicant may, 
wtth In one month from the date of mailing of this International search report submit comments to this Authority. 

6. The figure of the drawings to be publtehed with the abstract Is Figure No. 2 



[X] as suggested by the applicant. Q^j None of the figures. 

| | because the applicant failed to suggest a figure. 

| | because this figure better characterizes the Invention. 



Form PCT/1SA/210 (first sheet) (July 1996) 



INTERNATIONAL SEARCH REPORT 



International application No. 

PCT/US 99/23731 



Box I Observations where certain claims were found unsearchable (Continuation of Item 1 of first sheet) 

Thte International Search Report has not been established In respect of certain claims under Article 1 7(2)(a) for the following reasons: 



1. 



Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 

Remark: Although claims 54-61 

are directed to a method of treatment of the human/animal 
body, the search has been carried out and based on the alleged 
effects of the compound/composition. 

2. Q aalmsNos.: 

because they relate to parts of the International Application that do not comply with the prescribed requirements to such 
an extent that no meaningful International Search can be earned out specifically: 



3. Q Claims Nos.: 

because they are dependent claims and are not drafted In accordance with the second and third sentences of Rule 6.4(a). 

Box II Observations where unity of Invention Is lacking (Continuation of Item 2 of first sheet) 

This International Searching Authority found multiple Inventions In this International application, as follows: 



1 . I I As aJI required additional search fees were timely paid by the applicant, this International Search Report covers all 
1 — 1 searchable claims. 



2. | | As all searchable claims could be searched without effort Justifying an addrtional fee, this Authority did not Invite payment 
of any addrtional fee. 



3. I I As only some of the required addrtional search fees were timely paid by the applicant, this International Search Report 
1 — 1 covers only those claims for which fees were paid, specfflcalry claims Nos.: 



4. | | No required addrtional search fees were timely paid by the applicant Consequently, this International Search Report Is 
restricted to the Invention first mentioned In the claims; It Is covered by claims Nos.: 



Remark on Protest [ | The additional search fees were accompanied by the applicants protest 

| | No protest accompanied the payment of addrtional search fees. 
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INTERNATIONAL SEARCH REPORT 



International Application No 

PCT/US 99/23731 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC 7 C12N15/10 C12N15/31 C12N15/62 C12N15/63 C12N15/67 
C12N15/72 C07K14/245 

According to International Patent Ctaasflcaflon (IPC) or to both national ciaaeiflcarion and IPC 

B. FIELDS SEARCHED 

Minim cm documentation searched (daeaflcatton system to lowed by dasellcaOon symbols) 

IPC 7 C12N C07K 



Documentation searched other than mtilmum documentation to the extent that such documents are Included In the folds searched 



Electronic data base consutted during the International search (name of data base and, where practical, search terms used) 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category 0 


Ctation of document wth Indcatxxi, where appropriate, of the relevant passages 


Relevant to claim No. 


E 


WO 99 53079 A (SHEN XIAOYU ;TA0 JIANSHI 
(US); TALLY FRANCIS P (US); ZHANG JIANSU) 
21 October 1999 (1999-10-21) 
the whole document 


1,12-20 


p,x 


WO 99 36554 A (UPJOHN CO ;F0RD CHARLES W 
(US); QUINN CHERYL L (US)) 
22 July 1999 (1999-07-22) 
the whole document 


1,12-20 




WO 99 35494 A (CONNELLY GENE ;TA0 JIANSHI 
(US); GALLANT PAUL L (US); TALLY FRANCI) 
15 July 1999 (1999-07-15) 
the whole document 

-/-- 


1,12-120 



Further documents are tsted In the continuation of box C 



Patent family members are tsted in annex. 



9 Special categories of cted documents : 

"A* document defying the general state of the art which Is not 

considered to be of particular relevance 
°E" earlier document but pubtshed on or after the IrrtomatJooaJ 

fling date 

T" document which may throw doubts on priority c*akn<8) or 
which Is cited to establish the publication date of another 
ctatlon or other special reason (as specffled) 

"O* document referring to an oral dtocloaure, use, exhtoltlon or 
other means 

"P" document pubtshed prior to the IrtemationaJ ftt>g date but 
later than the priority date claimed 



T later document published after the ttemattoneJ fttig date 
or priority date and not In contVct wth the apptcatJon but 
cted to understand the princ$le or theory underlying the 
Invention 

"X" document of particular relevance; the claimed fcwentton 
cannot be considered novel or cannot be considered to 
Involve an Inventive step when the document Is taken alone 

*Y" document of particular relevance; the claimed Invention 

cannot be considered to Involve an Inventive step when the 
document Is combined wth one or more other such docu- 
ments, such comb Nation being obvious to a person sidled 
In the art 

doctrnent member of the same patent tamty 



Date of the actual completion of the International search 

23 February 2000 


Date of maJIng of the International search report 

09/03/2000 


Name and malfrig address of the ISA 

European Patent Office. P.B. 5616 Patentiaan 2 
NL - 2280 MV RJswIJk 
Tel. (+31-70) 340-2040, Tx. 31 651 epo nl, 
Fax: (+31-70) 340-3016 


Authorized officer 

Hornlg, H 
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(^Continuation) DOCUMENTS CONS** RED TO BE RELEVANT 



Category * 


Citation of doctment wth ln<ic*rJoawt>ere appropriate, of me relevant passages 


Relevant to ciaJm No. 


X 


GIZA P E ET AL: "A SELF-INDUCING 
RUNAWAY-REPLICATION PLASMID EXPRESSION 
SYSTEM UTILIZING THE ROP PROTEIN" 
GENE, NL, ELSEVIER BIOMEDICAL PRESS. 
AMSTERDAM, 
vol. 78, no. 1, 

1 January 1989 (1989-01-01), pages 73-84, 
XP000009825 

ISSN: 0378-1119 
the whole document 


41-45 


A 


D.G. YANSURA AND D.J. HENER: "Use of the 
Escherichia coll lac repressor and 
operator to control gene expression 1n 
Bacillus subtil 1s" 
PROC. NATL. ACAD. SCI., 
vol. 81, 1984, pages 439-443, XP002131397 
NATL. ACAD. SCI . .WASHINGTON, DC, US; 
the whole document 




A 


WO 97 04110 A (S0MAT0GEN INC ;WEICKERT 
MICHAEL J (US); GLASCOCK CHRISTOPHER B 
(US) 6 February 1997 (1997-02-06) 
the whole document 




A 


VANH00F G ET AL: "PROLINE MOTIFS IN 
PEPTIDES AND THEIR BIOLOGICAL PROCESSING" 
FASEB JOURNAL, US, FED. OF AMERICAN SOC. FOR 
EXPERIMENTAL BIOLOGY, BETHESDA, MD, 
vol. 9, no. 9, 

1 January 1995 (1995-01-01), pages 
736-744, XP002093621 

ISSN: 0892-6638 
cited 1n the application 
the whole document 




A 


WO 93 03156 A (UNIV TEXAS) 
18 February 1993 (1993-02-18) 
the whole document 




A 


S.F. BETZ ET AL.: "De novo design of 
native proteins: Characterization of 
proteins Intended to fold Into 
ant1 parallel , ROP-Uke, Four-Hel1x 
bundles" 
BIOCHEMISTRY, 

vol. 36, 1997, pages 2450-2458, 
XP002131398 

AM. CHEM. SOC. .WASHINGTON, DC, US 
cited 1n the application 
the whole document 




A 


WO 92 07071 A (UNIV ILLINOIS) 
30 April 1992 (1992-04-30) 
the whole document 

-/-- 
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Category 0 C Ration of document wth Indteaflon, where appropriate, of the relevant passages 



Relevant to cialm No. 



WO 90 07862 A (PROTEIN ENG CORP) 
26 July 1990 (1990-07-26) 
the whole document 

W0 96 40721 A (HARVARD COLLEGE ;UNIV EMORY 
(US)) 19 December 1996 (1996-12-19) 
the whole document 
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Inf ofTTuMon on patent f Mnity nMfnbwv 



Intamattoturi Application No 

PCT/US 99/23731 



Patent document 




Publication 




Patent family 




Publication 


ctted In search report 




date 




members) 




date 


WO 9953079 


A 


21-10-1999 


AU 


3746599 


A 


A 1 ^ 1 A A A 

01-11-1999 








AU 


2218199 


A 


26-07-1999 








WO 


9935494 


A 


15-07-1999 


WO 9936554 


A 


A A A*f 4 AAA 

22-07-1999 


AU 


AA< A^ A A 

2312699 


A 


02-08-1999 


i ■ s»\ A A. A ^ 4 A A 

WO 9935494 


A 


•i ^- A ^ ^ AAA 

15-07-1999 


AU 


2218199 


A 


26-07-1999 








WO 


9953079 


A 


21-10-1999 








AU 


3746599 


A 


01-11-1999 


U0 9704110 


A 


06-02-1997 


AU 


6489996 


A 


18-02-1997 


U0 9303156 


A 


18-02-1993 


US 


5212083 


A 


18-05-1993 








AU 


2323992 


A 


02-03-1993 


WO 9207071 


A 


30-04-1992 


US 


5217889 


A 


08-06-1993 








AU 


649759 


B 


02-06-1994 








AU 


9025091 


A 


20-05-1992 








CA 


2094144 


A 


20-04-1992 








EP 


0555370 


A 


18-08-1993 








JP 


6502309 


T 


17-03-1994 








NO 


931423 


A 


15-06-1993 








US 


^ A * < A A M 

5811234 


A 


AA AA n AAA 

22-09-1998 








us 


5753432 


A 


19-05-1998 








us 


5665550 


A 


09-09-1997 








us 


5942389 


A 


24-08-1999 








us 


5866327 


A 


02-02-1999 


WO 9007862 


A 


26-07-1990 


us 


5096815 


A 


17-03-1992 








AU 


4958890 


A 


13-08-1990 








CA 


2045516 


A 


07-07-1990 








EP 


0452413 


A 


23-10-1991 








JP 


4504052 


T 


23-07-1992 








US 


5198346 


A 


30-03-1993 


WO 9640721 




A 


_ 

19-12-1996 


US 


5733726 


A 


31-03-1998 








AU 


708058 


B 


A A A 1 AAA 

29-07-1999 








AU 


5967596 


A 


30-12-1996 








CA 


2222941 


A 


19-12-1996 








EP 


0837872 


A 


29-04-1998 








JP 


11506608 


T 


15-06-1999 
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Appioanfc c aggni c flto r*rerBiK» 

235.0001 0201 



mwmaDori* app*ca»on no, 

PCT/US99/23731 



566 NotrflcBtfcn of Tran*n*ttai of lnTwm»iicnt* 
FOR FURTHER ACTION Rituwrinory EXBmmaaon R*pon (Form pct.IPEamiCp 



erf I tog dalo (Otoy/trxvr/fvywJ 
12/10/1999 



13^1CV1098 



mtomafca-iaJ Patent ctassiffcstton ( PC) or natonnJ cWastffcaion and IPC 
C12KJ15M0 



Applicant 

THE UNIVERSITY OF GEORGIA RESEARCH ....Bt al 



1. This Int&mational prelminary examination report has been prepared by this Internationa! Preliminary Examining Authority 
and Is transmitted to the applicant according to Article 36. 

2. This REPORT consists 0T a total of 9 sheets, including thte cover sheet, 

K Thic report is al*o accompanied by ANMEXES, i.e sheets of the description, dams anovor arawlnos which riavo 
bGon amended and are the basis for the report and/or &neets containing: rectllicaiions made Deiore tnts Autnorrty 
(see Ruift 70.15 and Section 607 of the Administrative instructions under the PCT). 

These annexes consist of a total of Jfenaets. 



3, Thic roport contains indications relating to trie foHOWUlQ Items: 



1 


3 


II 


□ 


III 




IV 


8 


V 


E 


VI 


IS 


VII 


□ 


VIH 





citations and explanations au porting ouch stntenwit 



Dale o* suDmissior of the demand 

1 1/0S/2G0G 



Name «nc! malJlog acttrw of the international 
preliminary ejwrtnlng authority: 

s European Patort Office 
/ffl) D^0298 Munch 

Pa>f -49 S6 2309 - 4465 



□ale of compiettor otinia 'epo" 

1 6.02-2001 



Avuthortzed officer 

Biadter. c 

Telephone No +43 83 2339 7300 




Fcrm PCT/|PEA,/400 (oorvw choct) (January 1W4) 
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INTERNATIONAL PRELIMINARY 

EXAMINATION REPORT International application No. PCT/US99/23/31 



1. Basle of the report 

1 This report has been drawn on the baste of (substitute shoots wfvcb have boon ium.ehod to the n>c&v*ng Qthcs *r. 
nttponsm to Art invitation under Artkte 14 ar* nstorred to in this report aa *origtnalty filed* and are no! annexed tc 
tr>0 report since they do not contain amertdmsnta (Rutea 70 IS and 70 1 7)./. 
Description, pegec: 

1 -qq cfl originally fllea 

Claims, No.: 

as received on 04/1CV2000 with letter of 04/10/2000 

Drawings, anoats: 

1/9-0/8 ee originally filed 

Sequence listing pert of th» description, page*: 

1-31, ae originally rileo 

2. With regard to the language, all the elements marked above were available or furnished to this Authority in the 
language In which the International application was filed, unless otherwise indicated under this item. 

Those elements were available or lurnlshed to mis Authority In the tallowing language: . which Is: 

□ the language of a translation furnished for the purposes of the international eearch (under Rule 23. 1 (b)). 

□ the language of puDlicatlon oT the international application (under Rule 46.3(b)). 

□ tne language of a translation furnished for the purposes of international preliminary examination (under Rule 
55.2 and/or 55.3) 

3. With regard to any nucleotide and/or amino acid aequ«r>c* disclosed In tha international application, the 
international preliminary examination was carried out on the basis of the sequence listing: 

S contained in li>e international application in written form. 

□ tiled together with the intemationa 1 application in computer readable form. 

□ lurntsnerj subsequently to this Authority In written form. 

S furnished subsequently to this Authority In computer readable form 

H Tne statement that the subsequently furnished wntten sequence listing does not go beyond th9 disclosure in 

the International application as tiled has Deen furnished 
H Tne statement that the Information recorded In computer readable lorm is identical to the wntten sequence 

listing has been turnlaned. 

4. Th© amendments have resulted in the cancellation at: 



Form =>CT.1 PEA/409 I.Boxefl 1-Vlll, ShMt 1) (July 1S96) 
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INTERNATIONAL PRELIMINARY 

EXAMINATION REPORT International application No PCT/US99/23731 



□ trie description, pages: 

H thecdaims, Noa : 61 

□ tne drawings, sneets- 

5 □ Thia report haa been aaiabliarted as if (some of) the amendments had not oeen made, since they nave been 
conaiaered to go beyond the disclosure as Hied (Rde 70.2(c)): 

(Any mplac&ment sheet contatning such amendments must be referred to under item 1 and annexed to fas 
report.) 



6 ACGinonai cfcsarvaflons. tf necessary: 
cap separate sheet 



111. Norv*»tabli*hnrwnt of opinion with regard to novelty, Inventive ©top and industrial applicability 

i . The questions wnener the claimed invention appears to Da novel, to involve an inventive etep (to be non- 
oovious), or to be industrially applicable hove not oeen examined In respect of: 

□ the entire international application. 

claims Mot. 28-00. 



because: 



□ the said international application, or the aaid claims Noe. relate to the following subject matter which does 
not require an international preliminary examination (specify): 



H the description, claims or drawings (indicate particular elements below) or said claims Nos. 29-00 are so 
unclear tnat no meaningful opinion could be formed (specify): 
see separate sheet 

□ the claims, or said claims Nos. are so Inadequately supported by the description that no meaningful opinion 
caul d be formed 

□ no Into r national search ropon hae t>don oetabliehod for tho wid claim s No*. . 

2. A meaningful international preliminary examination report cannot De carried out due to the failure ol the nucleotide 
anoVor amino acid sequence listing to t-oniply with the standard provided for in Annex C of the Administrative 
Instructions. 

□ the written form has not been furnished or does not comply with the standard. 

□ tne computer reaoaDie form hes not been Tumisnea or ooes not comply wrtn the standard. 
W. Lack of unity of Invention 

1, in response to the Invitation to restrict or pay addftlonai tees the applicant has: 
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INTERNATIONAL PRELIMINARY 

EXAMINATION REPORT international application No PCT/US99/23731 



□ restricted trie claims, 

□ paw aMtJoral fees. 

□ part additional fe*s under protest 

□ neither restricted rw paid additional laas 

2. S This Authority found that the requirement of unity ot nventlon ia not complied and chose, accordng to Flute 

©8.1 , not to invito trm applicant to restnct or pay additional tees. 

3. mis Autfonty considers that trie requirement of unrry of Invention In accordance with Rjies 13.1. 13.2 and 13 3 is 

□ complied with. 

H not complied with for the following reaaone. 
*•* aaparata a heat 

4 Consequently, the following parts of the international application ware the subject of international prebmriary 
examination in establishing this report 

□ alt part*. 

H the pans relatlne to ciaima Noe. i -28. 

V. Reasoned statement under Article *a(2) with regard to noveJty, inventive Step or industrial applicability; 
oltailons and oxpl a nations supporting such statement 

t. Statement 

Novelty (N) Y«a: 
No: 

Inventive step (IS) Yes: 
No; 

Industrial applicability (IA) Yes: 

No; 

2. Citations and explanations 
see separate sheet 

VI. Certain documents oited 

1. Certain published document** (Rule 70.10) 

and / or 

2. Non-written disclosures (Rule 70.9) 

Fjrrr PCT/IPEA/40C (Bo job l-Vlli, Sheet 3> (July 1&98} 



Claims 1-26 
Claims 

Clajms 1-28 
Clajrns 

Claims 1-26 
Claims 
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INTERNATIONAL PRELIMINARY 
EXAMINATION REPORT 



IntematonaJ application No. PCT/US99/2373 1 



see separate aheet 
Vlll. Cenaln observations on me international application 

The loiiowng oD&ervattana on «n« Parity of the claims, description, and drawings or on me question whetner me 
claims ar« fully supported by trie description, are made: 



Fz'v PCTf PEA^40fi (Boxes l-Vll:. 8teet 4} (July 1996' 
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INTERNATIONAL PRELIMINARY International appication No PCT/US99/23731 
EXAMINATION REPORT - SEPARATE SHEET 



Re Item I 

Baal 3 of the report 

4. Additional observation* 

The amendments submitted in the letter of 04.10.2000 comply with the requirements of 
Article 34 PCT. 

Re Item III 

Norvestabllshment of opinion with regard to novelty. Inventive step and 
Industrial applicability 

No international preliminary opinion is given for the subject-matter of claims 29 to 50 
since those claims are directed to peptides and uses of peptides which are not 
technical defined. The feature 'stabilizing group' (claims 30 and 46), 'C-termin us\ N- 
terminus 1 (claim 30), four heiix bundle', 'polypeptide' (claim 41), 'cleavage cite' (claim 
46) 'stabilizing form of the peptide drug' (claim 54) are not true technical features in that 
they do not define a tangible peptide or group of peptides but encompass an infinite 
number of possible alternatives which may have quite different chemical compositions. 
Mo examination can be carried out for such unclear and broad claims and consequently 
the present notification Is only carry out for the subject-matter of claims 1-28 . 

Re Item IV 

Lack of unity of invention 

The present application lacks unity since the common concept which links the claims, 'a 
stable peptide', is not novel (see document D2, page 1 , lines 15-21). Consequently, the 
application splits up into various groups of inventions no longer connected by a 
common concept (the claims encompasses a multitude of products which constitutes 
Innumerable groups of invention). There is no link between all products neither the 
stabilisation features (see above) nor any other technical features. The latter because 
such technical features have not been defined. Consequently only products which 
share a common technical feature which distinguishes them from those of the prior an 
can be maintained In this application. However, even with regard to the description, 
such a common feature is not recognizable. Despite this serious objection, the 
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INTERNATIONAL PRELIMINARY international ap&t^cn Kc PC T/US 9 9/23731 
EXAMINATION REPORT - SEPARATE SHEET 

applicant is not invited to pay additional fees, however this objection will be followed uo 
by the IPEA in the case the application would enter the regional phase (OEB). 

Re Kern V 

Raaeonad statement under Article 35(21 with regard tp novelty. Inventive atep or 
Industrial applicability: citations and explanations supporting such statement 

Cited documents 

Reference is mads to the following documents: 

D1: D.G. YANSURA AND D.J. HENER: 'Use of the Escherichia cofi lac 

repressor and operator to control gene expression in Bacillus eubtiliti PROC. 
NATL. ACAD. SCI., vol. 81 , 1984, p439^43 

D2: WO 93031 56A 



Novelty and Inventive step - Articles 33(2) and 33(3} PCT 

The present application relates to an Intracellular selection system for screening 
antimicrobial peptides resistant to proteases and peptidases, This selection system is 
achieved by expressing in a host cell randomized recombinant peptides under the 
control of a portion of the wild-type E. colt lac promoter/operator region comprising fac 
operator 03, a CAP binding region, the -35 /ac promoter site, the 
-10 lac promoter site, lac operator 01, and the tacZ Shine-Dalgemo sequence 

Document D1 which is considered to represent the most relevant state of the art, 
discloses a system tor expressing recombinant peptides In a host cell under the control 
of E. coll lac promoter/operator (see Abstract, the last paragraph of The Introduction and 
Materials and Methods 'Hybrid promoters'). The subject-matter of the present 

application differs from D1 in that the E. cofi lac promoter/operator comprise© all of the 

lac control region that is contained between the start of the 03 auxiliary operator 
througn the end of the 01 operator, and not only the /ac operator 01 (see Figure 1 of 
the present application in comparison with Figure 1 of Document D1). 

According to the description, the technical effect of using all of the E. coii Ibc control 
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region that is contained between the start of the 03 auxiliary operator through tne end 
of the 01 operator is to obtain a more regulable expression vector i.e an expression 
vector that is more tightly repne6&ible when grown under repressed conditions. 

The problem to be solved by the present Invention may therefore be regarded as 
providing an expression system which would allow a better regulation of the cionea 
genes in order to ktentiry bloactlve peptides. 

The solution to this jjroblern provided by the application is to construct an expression 
vector comprising all of the E. coii lac control region that is contained betwoon the start 
of the 03 auxiliary operator through the end of the 01 operator. 

In view of the above defined problem, the question to be answered for the evaluation of 
the inventive 9tep is whether it was obvioua to construct an expression vector with all of 
the £ coli lac control region that is contained between the start of the 03 auxiliary 
operator through the end of the Ot operator in order to obtain a tightly regulable 
expression system. 

There i$ no indication in the available pnor art for the design of an expression vector 
with all of the E. coli lac control region that is contained between the start of the 03 
auxiliary operator through the end of the 01 operator in order to obtain a tightly 
regulable expression system for identifying bioactive peptides. Consequently, the IPEA 
is of the opinion that it would not be obvious for the person skilled in the art to design 
such an expression system and therefore the subject-matter ot claims 1-28 is regarded 
as inventive In the sense of Article 33{3) PCT 

Industrial applicability - Articles 33(1) and (4) PCT 

For the assessment of the present claims 50-61 on the question whether they are 
Industrially applicable, no unified criteria exist in the PCT Contracting States. The 
patentability can also be dependent upon the formulation of the claims. The EPO, for 
example, does not recognize as industrially applicable the subject- matter of claims to 
the use of a compound in medical treatment, but may allow, however, claims to a 
known compound for first use in medical treatment and the use of such a compound for 
the manufacture of a medicament for a new medical treatment. 
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Re Item VI 

Qcrtain documents cited 

Certain published documents (Rule 70.10) 

Apptocrtwn Nc pu « gallon UK» Fling oato 

Patent Ho (aayrtnvnih/yttr} iaeLWrnotmyeor) (<mySmoWy**r} 

WO90/53Q79 21.10.1999 14.04.1999 14.04.1998 

WOQ9/36554 22.07.1999 12.01.1999 16.01.1998 

W099/35494 15.07.1999 06.01.1999 09.01.1998 



Priority dm (vaiia cJatrn) 



Re Item VHI 

Certain observations on the international application 

The terms comprises at leasf and 'comprising' in claim 1 ('....th© tightly regulatable 
control region comprises at least a portion.... 1 ; '....said portion comprising auxiliary....') 
are too broad and vague and render the scope of this claim unclear. Consequently, 
those terms are considered inadmissible under Anicle 6 PCT. They might be replaced 
by the terms 'consists/consisting of. 
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WHATISCLAIMHDIS:. 

1 . A recombinant method for identifying a bioactive peptide comprising: 

(a) transforming a host cell with an exj .ssion vector comprising a 
tightly regulable control region operably linked to a nucleic acid sequence 

5 encoding a peptide; 

(b) growing the transformed cell under conditions that repress 
expression of the peptide; 

(c ) inducing expression of the peptide in the transformed host cell: 
(d) determining whether expression of the peptide is inhibitory of host 
10 cell growth, wherein inhibition of host cell growth is indicative of the expression 
of a bioactive peptide. 



2. The method of claim 1 wherein the tightly regulable control region of the 
expression vector comprises at least a portion of the wild-type E. coli lac 

15 promoter/operator region, said portion comprising auxiliary lac operator 03, a 
CAP binding region, the -35 lac promoter site, the -10 lac promoter site, lac 
operator 01, lacZ Shine-Dalgarno sequence and a spacer region; and wherein the 
transformed host cell comprises an amount of Lac repressor protein effective to 
repress expression of the peptide during step (b). 

20 

3. The method of claim 2 wherein the host cell is a bacterium. 



4, The method of claim 3 wherein the bacterium is a gram positive bacterium. 
25 5. The method of claim 3 wherein the bacterium is gram negative bacterium. 



6. The method of claim 3 wherein the bacterium is E. coli. 



7. The method of claim 2 wherein the host cell is a microbial pathogen. 



30 
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8. The method of claim 7 wherein the microbial pathogen is a member of a 
genus selected from the group consisting of Streptococcus. Staphylococcus and 
Entcrococcus. 



5 9. The method of claim 2 wherein the expression vector comprising the nucleic 
acid sequence encoding the peptide is a first expression vector, and wherein the 
host cell is further transformed, prior to step (b), with a second expression 
vector comprising a promoter operably linked to a gene encoding a Lac repressor 
protein. 

10 

10. The method of claim 2 wherein the expression vector has the identifying 
characteristics of pLACl 1 (ATCC No. 207108). 

1 1. The method of claim 10 wherein the expression vector is pLACl 1 (ATCC 
15 No. 207108). 

12. The method of claim 1 wherein the host cell comprises proteases or 
peptidases or both. 

20 13. The method of claim 1 wherein the host cell has not been modified to reduce 
or eliminate the expression of naturally expressed proteases or peptidases. 

14. The method of claim 1 wherein the host cell is a prokaryote. 

25 15. The method of claim 1 wherein the host cell is a microbial pathogen. 

16. The method of claim 15 wherein the microbial pathogen is a member of a 
genus selected from the group consisting of Streptococcus, Staphylococcus and 
Entcrococcus. 

30 

17. The method of claim 1 wherein the host cell is a eukaryotic cell. 



1 8. The inethod of claim 1 7 wherein the eukaryotic cell is a mammalian cell. 
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19. The method of claim 17 wherein the eukaryotic cell is a cancer cell. 

20. The method of claim 1 wherein the host cell is a protozoan. 

5 

21. The method of claim 1 wherein the peptide comprises a first stabilizing 
group comprising the N-terminus of the peptide and a second stabilizing group 
comprising the C -terminus of the peptide. 

10 22. The method of claim 21 wherein the first stabilizing group is selected from 
the group consisting of a small stable protein, Pro-, Pro-Pro-. Xaa-Pro- and Xaa- 
Pro-Pro-; and wherein the second stabilizing group is selected from the group 
consisting of a small stable protein. -Pro. -Pro-Pro. -Pro-Xaa and -Pro-Pro-Xaa. 

15 23. The method of claim 22 wherein the small stable protein is selected from the 
group consisting of Rop protein, glutathione sulfotransferase. thioredoxin, 
maltose binding protein and glutathione reductase. 

24. The method of claim 1 w herein the peptide comprises a stabilizing motif. 

20 

25. The method of claim 24 wherein the stabilizing motif comprises a 
hydrophilic a-helix motif. 

26. The method of claim 24 wherein the stabilizing motif comprises an opposite 
25 charge ending motif 

27. The method of claim 1 wherein the peptide comprises a randomized amino 
acid sequence. 

30 28. The method of claim 27 wherein the peptide comprises a first stabilizing 
group comprising the N-terminus of the peptide and a second stabilizing group 
comprising the C-terminus of the peptide. 
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29. The method of claim 27 wherein the peptide comprises a stabilizing motif. 

30. A bioactive peptide comprising a first stabilizing group comprising the N- 
terminus of the bioactive peptide and a second stabilizing group comprising the 

5 C -terminus of the bioactive peptide. 

31. The bioactive peptide of claim 30 wherein the first stabilizing group is 
selected from the group consisting of a small stable protein. Pro-, Pro-Pro-, Xaa- 
Pro- and Xaa-Pro-Pro-; and wherein the second stabilizing group is selected 

10 from the group consisting of a small stable protein. -Pro. -Pro-Pro, -Pro-Xaa and 
-Pro-Pro-Xaa. 

32. The bioactive peptide of claim 31 wherein the small stable protein is 
selected from the group consisting of Rop protein, glutathione sulfotransferase, 

1 5 thioredoxin, maltose binding protein and glutathione reductase. 

33. The bioactive peptide of claim 31 wherein the first stabilizing group is Pro- 
Pro- and the second stabilizing group is -Pro-Pro. 

20 34. The bioactive peptide of claim 30 wherein at least one of the first and 
second stabilizing groups comprises a small stable protein. 

35. The bioactive peptide of claim 34 wherein the small stable protein is a four- 
helix bundle protein. 

25 

36. The bioactive peptide of claim 34 wherein the small stable protein is 
selected from the group consisting of Rop protein, glutathione sulfotransferase, 
thioredoxin, maltose binding protein and glutathione reductase. 

30 37. The bioactive peptide of claim 36 wherein the small stable protein is Rop 
protein. 

38. The bioactive peptide of claim 30 which is an antimicrobial peptide. 
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39. The bioactive peptide of claim 30 which is a therapeutic peptide drug. 

40. A bioactive peptide comprising a plurality of sequential uniformly charged 
5 amino acids comprising the N -terminus of the bioactive peptide and a plurality 

of sequential oppositely charged amino acids comprising the C-terminus of the 
bioactive peptide. 

41 . A fusion protein comprising a four-helix bundle protein and a polypeptide. 

10 

42. The fusion protein of claim 41 wherein the four-helix bundle protein is Rop 
protein. 

43. The fusion protein of claim 42 wherein the polypeptide comprises a 
1 5 bioactive peptide. 

44. The fusion protein of claim 42 wherein the four-helix bundle protein is 
covalently linked at its C-terminus to the N-terminus of the polypeptide. 

20 45. The fusion protein of claim 42 wherein the four-helix bundle protein is 
covalently linked at its N-terminus to the C-terminus of the polypeptide. 

46. A polypeptide comprising: 

a bioactive peptide comprising (a) a first stabilizing group selected from 
25 the group consisting of a small stable protein. -Pro-, -Pro-Pro-. -Xaa-Pro- and - 
Xaa-Pro-Pro- and (b) a second stabilizing group selected from the group 
consisting of a small stable protein, -Pro, -Pro-Pro, -Pro-Xaa and -Pro-Pro-Xaa; 
and 

a cleavage site immediately preceding the first stabilizing group; 
30 wherein the second stabilizing group comprises the C-terminus of the 
polypeptide. 
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47. A polypeptide comprising: 

a bioactive peptide comprising (a) a first stabilizing group selected from 
the group consisting of Pro. Pro-Pro-. Xaa-Pro- and Xaa-Pro-Pro- and (b) a 
second stabilizing group selected from the group consisting of —Pro-. -Pro-Pro-. - 
Pro-Xaa- and -Pro-Pro-Xaa-; and 

a cleavage site immediately following the second stabilizing group; 
wherein the first stabilizing group comprises the N-terminus of the polypeptide. 



10 48. A polypeptide comprising: 

a bioactive peptide comprising a plurality of sequential uniformly 

charged amino acids comprising the N-terminus of the bioactive peptide and a 

plurality of sequential oppositely charged amino acids comprising the C- 

terminus of the bioactive peptide; and 
15 a cleavage site immediately preceding the plurality of sequential 

uniformly charged amino acids. 



49. A polypeptide comprising: 

a bioactive peptide comprising a plurality of sequential uniformly 
20 charged amino acids comprising the N-terminus of the bioactive peptide and a 
plurality of sequential oppositely charged amino acids comprising the C- 
terminus of the bioactive peptide; and 

a cleavage site immediately following the plurality of sequential 
oppositely charged amino acids. 

25 

50. A method for using an antimicrobial peptide comprising: 

covalently linking a first stabilizing group to the N-terminus of the 
antimicrobial peptide and a second stabilizing group to the C-terminus of the 
antimicrobial peptide to yield a stabilized antimicrobial peptide; and 
30 contacting a microbe with the stabilized antimicrobial peptide. 
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5L The method of claim 50 wherein the first stabilizing group is selected from 
the group consisting of a small stable protein. Pro-. Pro-Pro-, Xaa-Pro- and Xaa- 
Pro-Pro-; and wherein the second stabilizing group is selected from the group 
consisting of a small stable protein, -Pro, -Pro-Pro. -Pro-Xaa and -Pro-Pro-Xaa. 

5 

52. The method of claim 50 wherein the first stabilizing group is selected from 
the group consisting of Pro-, Pro-Pro-, Xaa-Pro- and Xaa-Pro-Pro- and the 
second stabilizing group is selected from the group consisting of -Pro, -Pro-Pro, 
-Pro-Xaa and -Pro-Pro-Xaa. 

10 

53. A method for using an antimicrobial peptide comprising: 

covalently linking a plurality of sequential uniformly charged amino 
acids to the N-terminus of the antimicrobial peptide and covalently linking a 
plurality of sequential oppositely charged amino acids to the C-terminus of the 
15 antimicrobial peptide to yield a stabilized antimicrobial peptide; and 

contacting a microbe with the stabilized antimicrobial peptide. 



54. A method for treating a patient having a condition treatable with a peptide 
drug comprising administering to the patient a stabilized form of the peptide 

20 drug. 

55. The method of claim 54 wherein the stabilized form of the peptide drug 
comprises a first stabilizing group comprising the N-terminus of the peptide 
drug and a second stabilizing group comprising the C-terminus of the peptide 

25 drug. 

56. The method of claim 55 wherein the first stabilizing group is selected from 
the group consisting of a small stable protein, Pro-. Pro-Pro-, Xaa-Pro- and Xaa- 
Pro-Pro-; and wherein the second stabilizing group is selected from the group 

30 consisting of a small stable protein, -Pro. -Pro-Pro. -Pro-Xaa and -Pro-Pro-Xaa. 



57. The method of claim 56 wherein the small stable protein is a four-helix 
bundle protein. 
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58. The method of claim 56 wherein the small stable protein is selected from 
the group consisting of Rop protein, glutathione sulfotransferase, thioredoxin. 
maltose binding protein and glutathione reductase. 

5 

59. The method of claim 55 further comprising, prior to administration of the 
stabilized form of the peptide drug, covalently linking the first stabilizing group 
to the N-terminus of a peptide drug and covalently linking the second stabilizing 
group to the C-terminus of the peptide drug to yield the stabilized form of the 

10 peptide drug. 

60. The method of claim 54 wherein the stabilized form of the peptide drug 
comprises an opposite charge ending motif 

15 61 . The method of claim 60 further comprising, prior to administration of the 
stabilized form of the peptide drug, covalently linking a plurality of sequential 
uniformly charged amino acids to the N-terminus of the peptide drug and 
covalently linking a plurality of sequential oppositely charged amino acids 
comprising the C-terminus of the peptide drug to yield the stabilized form of the 

20 peptide drug. 
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STABILIZED BIOACTI VE PEPTIDES AM) METHODS OF 
IDENTIFICATION. SYNTHESIS AND I SE 



This patent application claims the benefit of U.S. Provisional Patent 
Applications Serial Nos. 60 104.013. filed 13 October. 1908. and 00 1 12.150. 
filed 14 December, 1998. 

10 

Background of the Invention 



Bioactive peptides are small peptides that elicit a biological activity. 
Since the discovery of secretin in 1902. over 500 of these peptides which 
15 average 20 amino acids in size have been identified and characterized. They 
have been isolated from a variety of systems, exhibit a wide range of actions, 
and have been utilized as therapeutic agents in the field of medicine and as 
diagnostic tools in both basic and applied research. Tables 1 and 2 list some of 
the best known bioactive peptides. 
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Tabic 1: Bioactive peptides utilized in medicine 



Name 


Isolated From 


Si/e In 
Amino Acids 


Therapeutic Use 


Angiotensin II 


Human Plasma 


8 


Vasoconstrictor 


Brad\ kin in 


Human Plasma 


9 


Vasodilator 


Caerulein 


Prog Skin 


10 


Choleretic Agent 


Calcitonin 


Human Parathyroid Gland 


32 


Calcium Regulator 


Cholecystokinin 


Porcine Intestine 


3 3 


Choleretic Agent 


Corticotropin 


Porcine Pituitar\ Gland 


39 


Hormone 


hledoisin 


Octopod V enom 


1 ! 


Hypotensive Agent 


Gastrin 


Porcine Stomach 


1 7 


Gastric Activator 


Glucagon 


Porcine Pancreas 


29 


Antidiabetic Agent 


Gramicidin D 


Bacillus b rev is Bacteria 


1 1 


Antibacterial Agent 


Insulin 
Insulin A 
Insulin B 


Canine Pancreas 


21 

30 


Antidiabetic Agent 


Kallidin 


Human Plasma 


10 


Vasodilator 


Luteinizing 
Hormone- 
Releasing Factor 


Bovine Hypothalamus 


10 


Hormone Stimulator 


Melirtin 


Bee Venom 


26 


Antirheumatic Agent 


Oxytocin 


Bovine Pituitary Gland 


9 


Oxytocic Agent 


Secretin 


Canine intestine 


27 


1 lormone 


Sermorelm 


Human Pancreas J 29 


Hormone Stimulator 


Somatostatin 


Bovine Hypothalamus 


N 


Hormone Inhibitor 


Vasopressin 


Bovine Pituitary Gland 


9 


Antidiuretic Agent 
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1 able 2: Bioactive peptides utilized in applied research 



Name 


Isolated From 


Size In 
Amino Acids 


Biological Activity 


Atrial Natriuretic 
Peptide 


Rat Atria 


28 


Natriuretic .Agent 


Bombesin | Frog Skin 


14 


Gastric Activator 


Conantokin G 


Snail Venom 


17 


Neurotransmitter 


Conotoxin Gl 


Snail Venom 


13 


Neuromuscular Inhibitor 


Detenstn HNP-1 


Human Neutrophils 


30 


Ami m icrobial Agent 


Delta Sleep- 
Inducing 
Peptide 


Rabbit Brain 




N e u ro 1 og i c a 1 A tTec t or 


Dermaseptin 


Frog Skin 


34 


Antimicrobial Agent 


Dynorphin 


Porcine Brain 


17 


Neurotransmitter 


EETI II 


Ech allium elaierium 
seeds 


29 


Protease Inhibitor 


Endorphin 


Human Brain 


30 


Neurotransmitter 


Enkephalin 


Human Brain 


5 


Neurotransmitter 


Histatin 5 


Human Saliva 


24 


Antibacterial Agent 


Mastoparan 


Vespid Wasps 


14 


Mast Cell Degranuiator 


Magainin I 


Frog Skin 


23 


Antimicrobial Agent 


Melanocyte 
Stimulating 
Hormone 


Porcine Pituitary Gland 


13 


Hormone Stimulator 


Moulin 


Canine Intestine 




Gastric Activator 


Neurotensin 


Bovine Brain 


13 


Neurotransmitter 


Physalaemin 


Frog Skin 


1 1 


Hypotensive Agent 


Substance P 


Horse Intestine 


1 1 


Vasodilator 


Vasoactive 
Intestinal 
Peptide 


Porcine Intestine 


28 


Hormone 
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Where the mode of action oi these peptides has beer, determined, it has been 
found to be due to the interaction of the bioaeme peptide with a specific protein 
target. In most of the cases, the bioactive peptide acts by binding to and 
inactivating its protein target with extremely high specificities. Binding 
5 constants of these peptides for their protein targets t>picall> have been 

determined to be in the nanomolar (nM. lO -9 M > range with binding constants as 
high as \0' 12 M (picomolar range) having been reported. Table 3 shows target 
proteins inactivated by several different bioactive peptides as well as the binding 
constants associated with binding thereto. 

10 



Tabic 3: Binding constants of bioactive peptides 



Bioactive 

W It t ■ / 1 * » 

i triJUUL 


Size in 

/Mil III U 

Acids 


Inhibited 
Protein 


Binding 
Constant 


ot-Conotoxin GIA 


15 


Nicotinic Acetylcholine 
Receptor 


l.OxlO -9 M 


EETI II 


29 


Trypsin 


I . 0x10" 12 M 


H2 (7-15) 


8 


HSV Ribonucleotide 
Reductase 


3.6xl0~ 5 M 


Histatin 5 


24 


Bacteroides gingivals 
Protease 


5.5xl0~ 8 M 


Melittin 


26 


Calmodulin 


3.0X10 -9 M 


Myotoxin (29-42) 


14 


ATPase 


1.9x10" 5 M 


Neurotensin 


13 


Ni Regulator) Protein 


5.6X10 -11 M 


Pituitary Adenylate 
Cyclase Activating 
Polypeptide 


38 


Calmodulin 


l.SxlO" 8 M 


PKI (5-24) 


20 


cAMP- Dependent Protein 
Kinase 


2.3xl0" 3 M 


SCP (153-180) 


-» - 


Calpain 


3.0xl0" 8 M 


Secretin 


^ 7 


HSR G Protein 


3.2X10" 5 K 


Vasoactive Intestinal 
Peptide 


28 


GPRN1 G Protein 


2. 5x10" 3 K 
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Recently, there has been an increasing interest in employing synthetical!) 
derived bioactive peptides as novel pharmaceutical agents due to the impressive 
ability of the naturally occurring peptides to bind to and inhibit specific protein 
targets. Synthetically derived peptides could be useful in the development of 
5 new antibacterial, antiviral, and anticancer agents. Examples of synthetically 
derived antibacterial or antiviral peptide agents would be those capable of 
binding to and preventing bacterial or viral surface proteins from interacting with 
their host cell receptors, or preventing the action of specific toxin or protease 
proteins. Examples of anticancer agents would include synthetically derived 

10 peptides that could bind to and prevent the action of specific oncogenic proteins. 

To date, novel bioactive peptides have been engineered through the use 
of tw o different in vitro approaches. The first approach produces candidate 
peptides by chemically synthesizing a randomized library of 6-10 amino acid 
peptides (J. Eichler et aL Med. Res. Rev. 15:481-496 (1995); K. Lam, 

15 Anticancer Drug Des. 12:145-167 (1996); M. Lebl et aL Methods Enzymol. 
289:336-392 (1997)). In the second approach, candidate peptides are 
synthesized by cloning a randomized oligonucleotide library into a Ff 
filamentous phage gene, which allows peptides that are much larger in size to be 
expressed on the surface of the bacteriophage (H. Lowman, Ann. Rev. Biophys. 

20 Biomol. Struct. 26:401-424 (1997); G. Smith et aL et al. Meth. Enz. 217:228- 
257 (1 993 )). To date, randomized peptide libraries up to 38 amino acids in 
length have been made, and longer peptides are likely achievable using this 
system. The peptide libraries that are produced using either of these strategies 
are then typically mixed with a preselected matrix-bound protein target. 

25 Peptides that bind are eluted. and their sequences are determined, f rom this 
information new peptides are synthesized and their inhibitory properties are 
determined. This is a tedious process that only screens for one biological 
activity at a time. 

Although these in viiro approaches show promise, the use of 

30 synthetically derived peptides has not yet become a mainstay in the 

pharmaceutical industry. The primary obstacle remaining is that of peptide 
instability within the biological system of interest as evidenced by the unwanted 
degradation of potential peptide drugs by proteases and or petvi.lases in the L\>t 
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cells. There are three major classes of peptidases which, can degrade larger 
peptides: amino and carboxy exopeptidases which act at either the amino or the 
carboxy terminal end of the peptide, respectively, and endopeptidases which act 
on an internal portion of the peptide. Aminopeptidases. carbox> peptidases, and 
5 endopeptidases have been identified in both prokaryotic and eukar\otie cells. 
Many of those that have been extensively characterized were found to function 
similarly in both cell types. Interestingly, in both prokaryotic and eukaryotic 
systems, many more aminopeptidases than carboxypeptidases have been 
identified to date. 

10 Approaches used to address the problem of peptide degradation have 

included the use of D -ami no acids or modified amino acids as opposed to the 
naturally occurring L-amino acids (e.g., J. Eichler et al.. Med Res Rev. 15:481- 
496 (1995); L. Sanders. Eur. J. Drug Metabol. Pharmacokinetics 15: 95-102 
(1990)), the use of cyclized peptides (e.g.. R. Egleton, et al.. Peptides 18:1431- 

15 1439 (1997)), and the development of enhanced delivery systems that prevent 
degradation of a peptide before it reaches its target in a patient (e.g.. L. Wearley, 
Crit. Rev. Ther. Drug Carrier Syst. 8: 331-394 (1991); L. Sanders, Eur. J. Drug 
Metabol. Pharmacokinetics 15: 95-102 (1990)). Although these approaches for 
stabilizing peptides and thereby preventing their unwanted degradation in the 

20 biosystem of choice (e.g., a patient) are promising, there remains no way to 
routinely and reliably stabilize peptide drugs and drug candidates. Moreover, 
many of the existing stabilization and delivery methods cannot be directly 
utilized in the screening and development of novel useful bioactive peptides. A 
biological approach that would serve as both a method of stabilizing peptides 

25 and a method for identifying novel bioactive peptides would represent a much 
needed advance in the field of peptide drug development. 
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Summan of the 1 m ention 

The present invention provides an intracellular screening method tor 
identifying novel hioactive peptides. A host cell is transformed with an 
5 expression \ector comprising a tightly regulable control region operably linked 
to a nucleic acid sequence encoding a peptide. The transformed host cell is first 
grown under conditions that repress expression of the peptide and then, 
subsequently, expression of the peptide is induced. Phenotypic changes in the 
host cell upon expression of the peptide are indicative of bioactivity. and are 

1 0 evaluated. If. for example, expression of the peptide is accompanied by 
inhibition of host cell growth, the expressed peptide constitutes a bioactive 
peptide, in that it functions as an inhibitory peptide. 

Intracellular identification of bioactive peptides can be advantageously 
carried out in a pathogenic microbial host cell. Bioactive peptides having 

1 5 antimicrobial activity arc readily identified in a microbial host cell system. 

Further, the method can be carried out in a host cell that has not been modified to 
reduce or eliminate the expression of naturally expressed proteases or peptidases. 
When carried out in a host cell comprising proteases and peptides, the selection 
process of the invention is biased in favor of bioactive peptides that are protease- 

20 and peptidase-resistant. 

The tightly regulable control region of the expression vector used to 
transform the host cell according to the invention is preferably derived from the 
wild-type Escherichia coli lac operon. and the transformed host cell preferably 
comprises an amount of Lac repressor protein effective to repress expression of 

25 the peptide during host cell growth under repressed conditions. To insure a 

sufficient amount of Lac repressor protein, the host cell can be transformed with 
a second vector the overproduces Lac repressor protein. 

Optionally, the expression vector used to transform the host cell can be 
genetically engineered to encode a stabilized peptide that is resistant to 

30 peptidases and proteases, f or example, the coding sequence can be designed to 
encode a stabilizing group at cither or both of the peptide N-terminus or C- 
lerminus. As another example, the coding sequence can be designed to encode a 
stablizing motif such as an a-helix motif or an opposite charge ending ri'Oiif. as 
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described be Km. The presence of a stabilizing group ai a peptide terminus or a 
stabilizing motif can slow dow n the rate of intracellular degradation of the 
peptide. 

The invention further provides a bioacme peptide having a first 
5 stabilizing group comprising the N-terminus and a second stabilizing group 
comprising the C-terminus. Preferably, the first stabilizing group is selected 
trom the group consisting of a small stable protein. Pro-. Pro-Pro-. Xaa-Pro- and 
Xaa-Pro-Pro-; and the second stabilizing group is selected from the group 
consisting of a small stable protein. -Pro. -Pro-Pro. -Pro-Xaa and -Pro-Pro-Xaa. 

10 Suitable small stable proteins include Rop protein, glutathione sulfotransferase. 
thioredoxin, maltose binding protein, and glutathione reductase. In addition, the 
invention provides a bioactive peptide stabilized by an opposite charge ending 
motif, as described below. The bioactive peptide is preferably an antimicrobial 
peptide or a therapeutic peptide drug. 

1 5 Also provided by the invention is a polypeptide that can be cleaved to 

yield a bioactive peptide having a stabilizing group at either or both of its N- and 
C-termini. The cleavable polypeptide accordingly comprises a chemical or 
enzymatic cleavage site either immediately preceding the N-tcrminus of the 
bioactive peptide or immediately follow ing the C-terminus of the bioactive 

20 peptide. 

The invention further provides a fusion protein comprising a four-helix 
bundle protein, preferably the Rop protein, and a polypeptide. The four-helix 
bundle protein is positioned at cither the N-terminus or the C-terminus of the 
fusion protein, and accordingly can be fused to either the N-terminus or the C- 

25 terminus of the polypeptide. 

The present invention also provides a method for using an antimicrobial 
peptide. An antimicrobial peptide is stabilized by linking a first stabilizing 
group to the N-terminus of an antimicrobial peptide, and. optionally, a second 
stabilizing group to the C-terminus of the antimicrobial peptide. Alternatively, 

30 the antimicrobial peptide is stabilized by flanking the peptide sequence with an 
opposite charge ending motif, as described below. The resulting stabilized 
antimicrobial peptide is brought into contact w ith a microbe, preferably a 
pathogenic microbe, for example to inhibit the growth r. toxicity of the microbe. 
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1 he invention also provides a method lor treating a patient having a 
eondition treatable with a peptide drug, comprising administering to the patient a 
stabilized peptide drug having at least one of a first stabilizing group comprising 
the N-terminus of the stabilized peptide drug and a second stabilizing group 
5 comprising the C-terminus of the stabilized peptide drug. Optionally, prior to 
administration of the stabilized peptide drug, the first stabilizing group is 
covalcntly linked to the N-terminus of a peptide drug, and the second stabilizing 
group is covalcntly linked to the C-terminus of the peptide drug to yield the 
stabilized peptide drug. Alternatively, the method comprises administering to 
10 the patient a peptide drug that has been stabilized by Hanking the peptide 
sequence with an opposite charge ending motif, as described below. 

Brief Description of the Drawing s 

15 Figure 1 shows the control region (SEQ ID NO:l ) of the w ild-type lac 

operon from the auxiliary operator 03 through the translational start of the lacZ 
gene. DNA binding sites include the operators 03 and Ol (both underlined), 
CAP (boxed), the -35 site (boxed), and the -10 site (boxed), while important 
RNA and protein sites include the LacI translation stop site (TGA), the -1 lacZ 

20 transcription start site, the Shine Dalgarno (SD) ribosome binding site for lacZ. 
and the LacZ translation start site (ATG). 

Figure 2 is a map of plasmid pLACl 1. The unique restriction sites and 
the base pair at w hich they cut are indicated. Other sites of interest are also 
shown, including l et (98-1288). Rop (1931-2122). on (2551-3138). Amp (3309- 

25 4169). and lacPO (4424-4536). 

Figure 3 is a map of plasmid pLAC22. The unique restriction sites and 
the base pair at which they cut are indicated. Other sites of interest are also 
shown, including Tet (98-1288). Rop (1927-21 18). ori (2547-3134). Amp (3305- 
4165), lacM (4452-5536). and lacPO (5529-5641 ). 

30 Figure 4 is a map of plasmid pLAC33. The unique restriction sites and 

the base pair at which they cut are indicated. Other sites of interest are also 
shown, including Tet (98-1288). ori (1746-2333). Amp (2504-3364), and lacY'Q 
(3619-3731 ). 
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Figure 5 shows the response of the pl.AC'l 1 ~Lu Z construct (open circles ) 
to varying amounts of isopropyl [}-D-thiogalactoside (IPTGi. A tilled square 
indicates the [Vgalactosidase activ ity that was obtained when MG1655 or 
CSH27 cells were grown in rich media induced with 1 mM IPTG. while a filled 
5 diamond indicates the (i-galaetosidase activity that was obtained when MG165 5 
or CSH27 cells were grown in M c ) minimal lactose media. 

Figure 6 shows growth curves depicting the inhibitory effects of a two 
day inhibitor (pPepl2) versus a one day inhibitor ipPepl ). Data points for the 
control. pLACl 1, for pPepl , and for pPepl 2. are indicated by squares, circles. 
10 and triangles, respectively. 

Figure 7 is a map of the p-Rop(C) fusion vector. The unique restriction 
sites and the base pair at which they cut are indicated. Other sites of interest are 
also shown, including Rop (7-198). ori (627-1214). Amp (2245-1385). lacPO 
(2500-2612). 

15 Figure 8 is a map of the p(Ts T )Rop-fusion vector. The unique restriction 

sites and the base pair at which the} cut are indicated. Other sites of interest are 
also shown; Rop (7-204). ori (266-853). Amp (1024-1884). lacPO (2139-2251 ). 

Figure 9 illustrates a peptide (SEQ ID NO:2) having the opposite charge 
ending motif, wherein the amino and carboxy termini of the peptide are stabilized 

20 by the interactions of the opposite charge ending amino acids. 

Detailed Description 

The present invention represents a significant advance in the art of 
25 peptide drug development by allowing concurrent screening for peptide 

bioactivity and stability. Randomized recombinant peptides are screened tor 
bioactivity in a tightly regulated inducible expression system, preferably derived 
from the wild-type lac operon. that permits essentially complete repression of 
peptide expression in the host cell. Subsequent induction of peptide expression 
30 can then be used to identify peptides that inhibit host cell growth or possess 
other bioactivities. 

Intracellular screening of randomized peptides has many advantages over 
existing methods. Bioactivity is readily apn o .::,ni. many di'. jrse bioactivities can 
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be screened lor simultaneously, very large numbers or" peptides can be screened 
using easily generated peptide libraries, and the host ceil, if desired, can be 
genetically manipulated to elucidate an affected protein target. Advantageous!} . 
randomized peptides can be screened in a host cell that is identical to or closely 
5 resembles the eventual target cell tor antimicrobial applications. An additional 
and very important feature of this system is that selection is naturally biased in 
favor of peptides that are stable in an intracellular environment; i.e.. that are 
resistant to proteases and peptidases. Fortuitously bacterial peptidases are very 
similar to eukaryotic peptidases. Peptides that are stable in a bacterial host are 

10 thus likely to be stable in a eukaryotic cell as well, allowing bacterial cells to be 
used in initial screens to identify drugs that may eventually prove useful as 
human or animal therapeutics. 

The invention is directed to the identification and use of bioactive 
peptides. A bioactive peptide is a peptide having a biological activity. The term 

1 5 "bioactivity" as used herein includes, but is not limited to. any type of interaction 
with another biomolecule. such as a protein, glycoprotein, carbohydrate, for 
example an oligosaccharide or polysaccharide, nucleotide, polynucleotide, fatty 
acid, hormone, enzyme, eofactor or the like, whether the interactions involve 
covalent or noncovalent binding. Bioactivity further includes interactions of 

20 any type with other cellular components or constituents including salts, ions, 
metals, nutrients, foreign or exogenous agents present in a cell such as viruses, 
phage and the like, for example binding, sequestration or transport-related 
interactions. Bioactivity of a peptide can be detected, for example, by observing 
phenotypic effects in a host cell in which it is expressed, or by performing an in 

25 vitro assay for a particular bioactivity. such as affinity binding to a target 

molecule, alteration of an enzymatic activity, or the like. Examples of bioactive 
peptides include antimicrobial peptides and peptide drugs. Antimicrobial 
peptides are peptides that adversely affect a microbe such as a bacterium, virus, 
protozoan, or the like. Antimicrobial peptides include, for example, inhibitor) 

30 peptides that slow the growth of a microbe, microbiocidal peptides that are 
effective to kill a microbe (e.g.. bacteriocidal and virocidal peptide drugs, 
sterilants, and disinfectants), and peptides effective to interfere with microbial 
reproduction, host toxicity, or the like. Pen'iic drugs fo r therapeutic use in 
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humans or oilier animals include, for example, antimicrobial peptides thai are no! 
prohibitively toxic to the patient, peptides designed to elicit, speed up. slow 
down, or prevent various metabolic processes in the hobt such as insulin, 
oxytocin, calcitonin, gastrin, somatostatin, anticancer peptides, and the like. 
5 The term "peptide" as used herein refers to a plurality of amino acids 

joined together in a linear chain via peptide bonds. Accordingly, the term 
"peptide" as used herein includes a dipeptide. tripeptide. oligopeptide and 
polypeptide. A dipeptide contains two amino acids; a tripeptide contains three 
amino acids; and the term oligopeptide is typically used to describe peptides 

10 having between 2 and about 50 or more amino acids. Peptides larger than about 
50 are often referred to polypeptides or proteins. For purposes of the present 
invention, a "peptide" is not limited to an\ particular number of amino acids. 
Preferably, however, the peptide contains about 2 to about 50 amino acids, more 
preferably about 5 to about 40 amino acids, most preferably about 5 to about 20 

1 5 amino acids. 

The library used to transform the host cell is formed by cloning a 
randomized, peptide-encoding oligonucleotide into a nucleic acid construct 
having a tightly regulable expression control region. An expression control 
region can be readily evaluated to determine whether it is "tightly regulable," as 

20 the term is used herein, by bioassay in a host cell engineered to contain a mutant 
nonfunctional gene "X." Transforming the engineered host cell with an 
expression vector containing a tightly regulable expression control region 
operably linked to a cloned wild-type gene "X" will preserve the phenotype of 
the engineered host cell under repressed conditions. Under induced conditions, 

25 however, the expression vector containing the tightly regulable expression 
control region that is operably linked to the cloned wild-type gene "X" will 
complement the mutant nonfunctional gene X to yield the wild-type phenotype. 
In other words, a host cell containing a null mutation which is transformed with 
a tightly regulable expression vector capable of expressing the chromosomal!} 

30 inactivated gene will exhibit the null phenotype under repressed conditions: but 
when expression is induced the cell will exhibit a phenotype indistinguishable 
from the wild-type cell. It should be understood that xh^ expression control 
region in the tightly regulable ^'^ression ve;v ; of the present invention can be 
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readily modified to produce higher levels of an encoded biopeptide. if desired 
(see. e.g.. Example I. below ). Such modification may unaxoidably introduce 
some "leakiness" into expression control, resulting in a low level of peptide 
expression under repressed conditions. 
5 In a preferred embodiment, the expression control region of the inducible 

expression vector is derived from the wild-type E. coli lac promoter operator 
region. In a particularly preferred form, the expression vector contains a 
regulator}' region that includes the auxiliary operator 03. the CAP binding 
region, the -35 promoter site, the -10 promoter site, the operator Ol. the Shine- 

1 0 Dalgarno sequence for IacZ, and a spacer region between the end of the Shine- 
Dalgarno sequence and the ATG start of the IacZ coding sequence (see Fig. 1 ). 

It is to be understood that variations in the wild-type nucleic acid 
sequence of the lac promoter/operator region can be tolerated in the expression 
control region of the preferred expression vector and are encompassed by the 

15 invention, provided that the expression control region remains tightly regulable 
as defined herein. For example, the -10 site of the w ild-type lac operon 
(TATGTT) is weak compared to the bacterial consensus -10 site sequence 
TATA AT, sharing four out of six positions. It is contemplated that other 
comparably weak promoters are equally effective at the -10 site in the 

20 expression control region: a strong promoter is to be avoided in order to insure 
complete repression in the uninduced state. With respect to the -35 region, the 
sequence of the wild-type lac operon, TTTACA. is one base removed from the 
consensus -35 sequence TTGACA. It is contemplated that a tightly regulable 
lac operon-derived expression control region could be constructed using a 

25 weaker -35 sequence (i.e.. one having less identity with the consensus -35 

sequence) and a wild-type -10 sequence (TATA AT), yielding a weak promoter 
that needs the assistance of the CAP activator protein. Similarly, it is to be 
understood that the nucleic acid sequence of the CAP binding region can be 
altered as long as the CAP protein binds to it with essentially the same affinity. 

30 The spacer region between the end of the Shine-Dalgarno sequence and the ATG 
start of the IacZ coding sequence is typically between about 5 and about 10 
nucleotides in length, preferably about 5 to about 8 nucleotides in length, more 
preferably about 7-9 nucleotide c in length. Th.: most preferred composition and 
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length of the spacer region depends on the composition and length of Shine- 
Dalgarno sequence with which it is operahly linked as well as live translation 
start codon employed (i.e.. AUG. GCG. or UUG ). and can be determined 
accordingly hy one of skill in the art. Preferably the nucleotide composition of 
5 the spacer region is "AT rich": that is. it contains more .Vs and T's than it does 
G's and C's. 

In a preferred embodiment of the method of the invention, the expression 
vector has the identifying characteristics of pLACl 1 (ATCC No. 207108). More 
preferably, the expression vector is pLACl 1 (ATCC No. 207108). 

10 As used in the present invention, the term "vector" is to be broadly 

interpreted as including a plasmid. including an episome. a viral vector, a 
cosmid, or the like. A vector can be circular or linear, single-stranded or double- 
stranded, and can comprise RNA. DNA. or modifications and combinations 
thereof. Selection of a vector or plasmid backbone depends upon a variety of 

15 characteristics desired in the resulting construct, such as selection marker(s). 
plasmid copy number, and the like. A nucleic acid sequence is "operably 
linked" to an expression control sequence in the regulatory region of a vector, 
such as a promoter, when the expression control sequence controls or regulates 
the transcription and/or the translation of that nucleic acid sequence. A nucleic 

20 acid that is "operably linked" to an expression control sequence includes, for 
example, an appropriate start signal (e.g., ATG) at the beginning of the nucleic 
acid sequence to be expressed and a reading frame that permits expression of the 
nucleic acid sequence under control of the expression control sequence to yield 
production of the encoded peptide. The regulators region of the expression 

25 vector optionally includes a termination sequence, such as a codon for which 
there is no corresponding aminoacetyl-tRNA, thus ending peptide synthesis. 
Typically, when the ribosome reaches a termination sequence or codon during 
translation of the mRNA. the polypeptide is released and the ribosome-mRNA- 
tRNA complex dissociates. 

30 An expression vector optional ly includes one or more selection or marker 

sequences, which typically encode an enzyme capable of inactivating a 
compound in the growih medium. The inclusion of a marker sequence can. for 
example, render the hos' *;cii resistan* <o an antibiotic, or it can confer a 
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compound-specific metabolic advantage on the host cell. Celi> can he 
transformed with the expression \ector using an> convenient method known in 
the art. including chemical transformation, e.g.. whereby cells are made 
competent by treatment with reagents such as CaCb: electroporation and other 
electrical techniques: microinjection and the like. 

In embodiments of the method that make use of a tightU regulable 
expression system derived from the Lie operon. the host cell is or has been 
genetically engineered or otherwise altered to contain a source of Lac repressor 
protein in excess of the amount produced in wild-type E. coll. A host cell that 
contains an excess source of Lac repressor protein is one that expresses an 
amount of Lac repressor protein sufficient to repress expression of the peptide 
under repressed conditions, i.e.. in the absence of an inducing agent, such as 
isopropyl (^-D-thiogalactoside (IPTG). Preferably, expression of Lac repressor 
protein is constitutive. For example, the host cell can be transformed with a 
second vector comprising a gene encoding Lac repressor protein, preferably lacL 
more preferably lacE\ to provide an excess source of Lac repressor protein in 
trans, i.e.. extraneous to the tightly regulable expression vector. An episome can 
also serve as a trans source of Lac repressor. Another option for providing a 
trans source of Lac repressor protein is the host chromosome itself, which can be 
genetically engineered to express excess Lac repressor protein. Alternatively, a 
gene encoding Lac repressor protein can be included on the tightly regulable 
expression vector that contains the peptide-encoding oligonucleotide so that Lac 
repressor protein is provided in cis. The gene encoding the Lac repressor protein 
is preferably under the control of a constitutive promoter. 

The invention is not intended to be limited in any way by the type of host 
cell used for screening. The host cell can be a prokaryotic or a eukaryotic cell. 
Preferred mammalian cells include human cells, of any tissue type, and can 
include cancer cells or hybridomas. without limitation. Preferred bacterial host 
cells include gram negative bacteria, such as E. coli and various Salmonella spp.. 
and gram positive bacteria, such as bacteria from the genera Staphylococcus. 
Streptococcus and Entcrococcus. Protozoan cells are also suitable host cells. In 
clear contrast to conventional recombinant protein expression systems, it is 
preferable that the Iv.^i. cell conrvi.., proteases and or peptidases, since the 
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selection will, as a result, be advantageously hia>ed in !a\or of peptides that are 
protease- and peptidase-resistant. More preferably, the host cell has no; been 
modified, genetically or otherw ise, to reduce or eliminate the expression of any 
naturally expressed proteases or peptidases. The host cell can be selected with a 
5 particular purpose in mind. For example, if it is desired to obtain peptide drugs 
specific to inhibit Staphylococcus, peptides can be advantageously expressed and 
screened in Staphylococcus. 

There is. accordingly, tremendous potential for the application of this 
technology in the development of new antibacterial peptides useful to treat 

1 0 various pathogenic bacteria. Of particular interest are pathogenic Staphylococci. 
Streptococci, and Entcrococci. which are the primary causes of nosocomial 
infections. Many of these strains are becoming increasingly drug-resistant at an 
alarming rate. The technology of the present invention can be practiced in a 
pathogenic host cell to isolate inhibitor peptides that specifically target the 

15 pathogenic strain of choice. Inhibitory peptides identified using pathogenic 

microbial host cells in accordance with the invention may have direct therapeutic 
utility; based on what is known about peptide import, it is very likely that small 
peptides are rapidly taken up by Staphylococci. Streptococci, and Enterococci. 
Once internalized, the inhibitory peptides identified according to the invention 

20 would be expected to inhibit the growth of the bacteria in question. It is 

therefore contemplated that novel inhibitor peptides so identified can be used in 
medical treatments and therapies directed against microbial infection. It is 
further contemplated that these novel inhibitor peptides can be used, in turn, to 
identify additional novel antibacterial peptides using a synthetic approach. The 

25 coding sequence of the inhibitory peptides is determined, and peptides are then 
chemically synthesized and tested in the host cell for their inhibitory properties. 

Novel inhibitor peptides identified in a pathogenic microbial host cell 
according to the invention can also be used to elucidate potential new drug 
targets. The protein target that the inhibitor peptide inactivated is identified 

30 using reverse genetics by isolating mutants that are no longer inhibited by the 
peptide. These mutants are then mapped in order to precisely determine the 
protein target that is inhibited. New antibacterial drugs can then be developed 
using various known or ye; * ; oe discover ■ j pharmaceutical strategies. 
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Following transformation of the host cell, the transformed host ceil i> 
initially grown under conditions that repress expression of the peptide. 
Fxpression of the peptide is then induced. For example, when a Lie 
promoter operator system is used for expression. IPTCi is added to the culture 
5 medium. A determination is subsequently made as to whether the peptide is 
inhibitory to host cell growth, wherein inhibition of host cell growth under 
induced but not repressed conditions is indicative of the expression of a 
bioactive peptide. 

Notably, the bioactive peptides identified according to the method of the 

10 invention are, by reason of the method itself, stable in the intracellular 

environment of the host cell. The method of the invention thus preferably 
identifies bioactive peptides that are resistant to proteases and peptidases. 
Resistance to proteases and peptidases can be evaluated by measuring peptide 
degradation when in contact with appropriate cell extracts or purified peptidases 

1 5 and/or proteases, employing methods well-known in the art. A protease- or 

peptidase-resistant peptide is evidenced by a longer half-life in the presence of 
proteases or peptidases compared to a control peptide. 

Randomized peptides used in the screening method of the invention can 
be optionally engineered according to the method of the invention in a biased 

20 synthesis to increase their stability by making one or both of the N-terminal or 
C-terminal ends more resistant to proteases and peptidases, and/or by 
engineering into the peptides a stabilizing motif 

In one embodiment of the screening method of the invention, the putative 
bioactive peptide is stabilized by adding a stabilizing group to the N-terminus. 

25 the C-tcrminus, or to both termini. To this end, the nucleic acid sequence that 

encodes the randomized peptide in the expression vector or the expression vector 
itself is preferably modified to encode a first stabilizing group comprising the N- 
terminus of the peptide, and a second stabilizing group comprising the C- 
terminus of the peptide. 

30 The stabilizing group can be a stable protein, preferably a small stable 

protein such as thioredoxin. glutathione sulfotransferase. maltose binding 
protein, glutathione reductase, or a four-helix butxiie protein such as Rop 
protein, although no specif e size limitation on the protein anchor is intended. 
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Proteins suitable for use as a stabilizing group can be either naiuralK occurring 
or non-naturally occurring. They can be isolated from an endogenous source, 
chemically or enzymatically synthesized, or produced using recombinant DNA 
technology. Proteins that are particularly well-suited lor use as a stabilizing 
5 group are those that are relatively short in length and form \ er> stable structures 
in solution. Proteins having molecular weights of less than about 50 kD are 
preferred for use as a stabilizing group; more preferably the molecular weight of 
the small stable protein is less than about 25 kD. most preferably less than about 
12 kD. For example. E. co/i thioredoxin has a molecular w eight about 1 1 .7 kD; 

10 E. coli glutathione sulfotransferase has a molecular w eight of about 22. 9 kD. and 
Rop from the ColEl replicon has a molecular w eight of about 7.2 kD; and 
maltose binding protein (without its signal sequence) is about 40.7 kD. The 
small size of the Rop protein makes it especially useful as a stabilizing group, 
fusion partner, or peptide anchor, in that it is less likely than larger proteins to 

1 5 interfere with the accessibility of the linked peptide, thus preserving its 

bioactivity. Rop's highly ordered anti-parallel four-helix bundle topology (after 
dimerization) and slow unfolding kinetics ( see, e.g.. Betz et ah Biochemistry 36, 
2450-2458 (1997)) also contribute to its usefulness as a peptide anchor according 
to the invention. Other proteins with similar folding kinetics and/or 

20 thermodynamic stability (e.g.. Rop has a midpoint temperature of denaturation. 
T m . of about 71°C, Steif et ah. Biochemistry 32, 3867-3876 ( 1 993 )) are also 
preferred peptide anchors. Peptides or proteins having highly stable tertiary 
motifs, such as a four-helix bundle topology, are particularly preferred. 

Alternatively, the stabilizing group can constitute one or more prolines 

25 (Pro). Preferably, a proline dipeptide (Pro-Pro) is used as a stabilizing group, 
however additional prolines may be included. The encoded proline* s) are 
typically naturally occurring amino acids, however if and to the extent a proline 
derivative, for example a hydroxyproline or a methyl- or ethyl- proline 
derivative, can be encoded or otherwise incorporated into the peptide, those 

30 proline derivatives are also useful as stabilizing groups. 

At the N-terminus of the peptide, the stabilizing group can alternatively 
include an oligopeptide having the sequence Xaa-Pro n w herein Xaa is any 
amino acid ..a m is gre _i than 0. Preferably, m is about 1 to about 5- 
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preferably m ^ 2 or ?. more preferably, m 2. Like\\i>e. at the C-terminus of 
the peptide, the stabilizing group can alternau\ el\ include an oligopeptide 
having the sequence -Pro rn -Xaa. wherein Xaa is an\ ammo acid, and m is greater 
than 0. Preferably, n is about 1 to about 5: preferably n = 2 or 3. more 
5 preferably, m = 2. in a particularly preferred embodiment of the method of the 
invention, the nucleic acid sequence that encodes the randomized peptide in the 
expression vector is modified to encode each of a fir-t stabilizing group 
comprising the N-terminus of the peptide, the first stabilizing group being 
selected from the group consisting of small stable protein. Pro-. Pro-Pro-. Xaa- 

10 Pro- and Xaa-Pro-Pro-. and a second stabilizing group comprising the C- 

terminus of the peptide, the second stabilizing group being selected from the 
group consisting of a small stable protein. -Pro. -Pro-Pro, Pro-Xaa and Pro-Pro- 
Xaa. The resulting peptide has enhanced stability in the intracellular 
environment relative to a peptide lacking the terminal stabilizing groups. 

15 In another preferred embodiment of the screening method of the 

invention, the expression vector encodes a four-helix bundle protein fused, at 
either the C-terminus or the N-terminus. to the randomized peptide. Preferably, 
the four-helix bundle protein is E, coli Rop protein or a homolog thereof. The 
non-fused terminus of the randomized peptide can. but need not, comprise a 

20 stabilizing group. The resulting fusion protein is predicted to be more stable 

than the randomized peptide itself in the host intracellular environment. Where 
the four-helix bundle protein is fused to the N-terminus. the randomized peptide 
can optionally be further stabilized by engineering one or more prolines, with or 
without a following undefined amino acid (e.g.. -Pro. -Pro-Pro. -Pro-Xaa. -Pro- 

25 Pro-Xaa. etc. ) at the C-terminus of the peptide sequence; likewise, when the 

four-helix bundle protein is fused to the C-terminus. the randomized peptide can 
be further stabilized by engineering one or more prolines, with or without a 
preceding undefined amino acid {e.g.. Pro-. Pro-Pro-. Xaa-Pro-. Xaa-Pro-Pro-. 
etc.) at the N-terminus of the peptide sequence. 

30 In yet another embodiment of the screening method of the invention, the 

putative bioactive peptide is stabilized by enem;.cring into the peptide a 
stabilizing motif such as an ox-he!! a motif or an opposite charge ending motif. 
Chemic 11 .nthesis o f oligonucleotide according to the scheme 
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j(CAu )A( I 'CAG)] > iclJs aii oligonucleotide encoding a peptide consisting of a 
random mixture of the hydrophilic amino acids His. Gin. Asn. L> s. Asp. and Glu 
(see Table 14). hxcept for Asp. these ammo acids are most often associated with 
cx-helical secondary structural motifs; the resulting oligonucleotides are thus biased 
5 in favor of oligonucleotides that encode peptides that are likely to form u-helices in 
solution. Alternatively, the putative bioactive peptide is stabilized by thinking a 
randomized region with a region of uniform charge (e.g.. positive charge) on one 
end and a region of opposite charge (e.g.. negative) on the other end. to form an 
opposite charge ending motif. To this end. the nucleic acid sequence that encodes 

K) the randomized peptide in the expression vector or the expression vector itself is 
preferably modified to encode a plurality of sequential uniformly charged amino 
acids comprising the N-terminus of the peptide, and a plurality of sequential 
oppositely charged amino acids comprising the C-terminus of the peptide. The 
positive charges are supplied by a plurality of positively charged amino acids 

1 5 consisting of lysine, histidine. arginine or a combination thereof; and the negative 
charges are supplied by a plurality of negatively charged amino acids consisting of 
aspartate, glutamate or a combination thereof. It is expected that such a peptide 
will be stabilized by the ionic interaction of the two oppositely charges ends. 
Preferably, the putative bioactive peptide contains at least three charged amino 

20 acids at each end. More preferably, it contains at least four charged amino acids at 
each end. In a particularly preferred embodiment, the larger acidic amino acid 
glutamate is paired with the smaller basic amino acid lysine, and the smaller acidic 
amino acid aspartate is paired with the larger basic amino acid arginine. 

It is to be understood that novel bioactive peptides identified using the 

25 method for identification of bioactive peptides described herein are also included 
in the present invention. 

The present invention further provides a bioactive peptide containing one 
or more structural features or motifs selected to enhance the stability of the 
bioactive peptide in an intracellular environment. During development and 

30 testing of the intracellular screening method of the present inv ention, it was 
surprisingly discovered that sev eral bio-v;;ice peptides identified from the 
randomized peptide librarv shared particular structural features. For example, a 
di^ jjortionatc 1 * .agh number of bioactiv e peptides identified ".sing the 
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intracellular screening method contained one or more proline residues at or near 
a peptide terminus. A disproportionate number also contained sequences 
predicted, using structure prediction algorithms well-known in the art, to form 
secondary structures such as a helices or f> sheets; or a hydrophobic membrane 
5 spanning domain. Bioactive fusion proteins comprising the randomized peptide 
sequence fused to the Rop protein, due to a deletion event in the expression 
vector, were also identified. 

Accordingly, the invention provides a bioactive peptide having a 
stabilizing group at its Ni-terminus, its C-terminus. or at both termini. In a 

10 bioactive peptide stabilized at only one terminus (i.e., at either the N- or the C- 
terminus) the stabilizing group is preferably either a four-helix bundle protein, 
such as Rop protein, a proline (Pro), or a proline dipeptide (Pro-Pro). It should 
be understood that in any synthetic peptide having a stabilizing group that 
includes one or more prolines according to the present invention, the proline is 

1 5 preferably a naturally occurring amino acid; alternatively, it can be a synthetic 
derivative of proline, for example a hydroxyproline or a methyl- or ethyl- proline 
derivative. Accordingly, where the abbreviation "Pro" is used herein in 
connection with a stabilizing group that is part of a synthetic peptide, it is meant 
to include proline derivatives in addition to a naturally occurring proline. 

20 A peptide stabilized at both termini includes a first stabilizing group 

comprising the N-terminus, and a second stabilizing group stabilizing the C- 
terminus, where the first and second stabilizing groups are as defined previously 
in connection with the method for identifying bioactive peptides. The stabilizing 
group is covalently attached to the peptide. The bioactive peptide of the 

25 invention includes a bioactive peptide that has been detectably labeled. 

derivatized. or modified in any manner desired prior to use, provided it contains 
one or more terminal stabilizing groups as provided herein. In one preferred 
embodiment of the bioactive peptide of the invention, the first stabilizing group, 
comprising the N-terminus. is Xaa-Pro-Pro-. Xaa-Pro-. Pro- or Pro-Pro-: and 

30 second stabilizing group, comprising the C-terminus. is Pro-Pro-Xaa. -Pro-Xaa. - 
Pro or -Pro-Pro; preferably -Pro-Pro. In another preferred embodiment, the first 
(N-terminal) stabilizing e r Oup is a small stable protein, preferably a four-helix 
bi ir " '.e protein c ,i as Rop protein; and the second (C-termina' x stabilizing 
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group is Pro-Pro-Xaa, -Pro-Xaa. -Pro or -Pro-Pro; preferably -Pro-Pro. in yet 
another preferred embodiment, the second iC-terminal ) stabilizing group is a 
small stable protein, preferably a tour-helix bundle protein such as Rop protein, 
and the first (N-terminal) stabilizing group is Pro-. Pro-Pro-. Xaa-Pro- or Xaa- 
5 Pro -Pro-. 

1 he invention further provides a peptide stabilized by flanking the amino 
acid sequence of a bioactive peptide with an opposite charge ending motif, as 
described herein. Preferably, the resulting stabilized peptide retains at least a 
portion of the biological activity of the bioactive protein. The stabilized peptide 

10 includes a peptide that has been detectably labeled, derivatized. or modified in 
any manner desired prior to use. 

It should be understood that any bioactive peptide, without limitation, 
can be stabilized according to the invention by attaching a stabilizing group to 
either or both of the N- and C-termini. or by attaching oppositely charged groups 

1 5 to the N- and C-termini to form an opposite charge ending motif. Included in the 
present invention are any and various antimicrobial peptides, inhibitory peptides, 
therapeutic peptide drugs, and the like. as. for example and without limitation, 
those listed in Tables 1 and 2. that have been modified at one or both peptide 
termini to include a stabilizing group, for example a four-helix bundle protein 

20 such as Rop protein, proline (Pro-), a proline-proline dipeptide (Pro-Pro-), an 
Xaa-Pro- dipeptide, or an Xaa-Pro-Pro-tripeptide at the N-terminus. and/or a 
four-helix bundle protein such as Rop protein, proline (-Pro), or a proline-proline 
dipeptide (-Pro-Pro), a Pro-Xaa dipeptide. or a Pro-Pro-X tripeptide at the C- 
terminus; or that have been modified to contain an opposite charge ending motif 

25 according to the invention. In this aspect the invention is exemplified by 

peptides such as Pro-Pro-Asp- Aru-Yal-T\ r-Ile-His-Pro-Phe-His-lle-Pro-Pro 
(SKQ ID NO: 3) and Glu-Asp-Glu-Asp-Asp-Arg-Val-Tyr-Ile-His-Pro-Phe-His- 
Ile-Arg-Lys-Arg-Lys (SI:Q ID NO: 4). wherein the middle nine amino acids (- 
Asp-Aru-Val-Tyr-IIe-His-Pro-Phe-His-Ile-: SPQ ID NO: 5) constitute the 

30 sequence of angiotensin. 

Modification of a bioactive peptide to yield a stabilized bioacti\e peptide 
according to the invention can be achieved by standard techniques well-known 
. me arts o f Pieties and peptide synthesis. For example, where the peptide is 
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svnthesized iic novo, as in solid state peptide synthesis, one or more prolines can 
he added at the beginning and the end of the peptide chain during the synthetic 
reaction. In recombinant synthesis, tor example as described in hxample III 
herein, one or more codons encoding proline can be inserted into the peptide 
5 coding sequence at the beginning and or the end of the sequence, as desired. 

Preferably, codons encoding N-terminal prolines are inserted after (i.e.. 3* to) the 
initiation site ATG (which encodes for methionine). Analogous techniques are 
used to synthesize bioactive peptides having an opposite charge ending motif. 
When a known bioactive peptide is modified to yield a stabilized bioactive 
10 peptide according to the invention, the unmodified peptide can conveniently be 
used as a control in a protease- or peptidase-resistance assay as described 
hereinabove to confirm, if desired, that the modified peptide exhibits increased 
stability. 

The present invention also provides a clcavable polypeptide comprising a 

15 stabilized, bioactive peptide either immediately preceded by (i.e.. adjacent to the 
N-terminus of the bioactive peptide) a cleavage site, or immediately followed by 
(i.e., adjacent to the C-terminus of the bioactive peptide) a cleavage site. Thus, a 
bioactive peptide as contemplated by the invention can be part of a cleavable 
polypeptide. The cleavable polypeptide is cleavable. either chemically, as with 

20 cyanogen bromide, or enzymatically. to yield the bioactive peptide. The 

resulting bioactive peptide either includes a first stabilizing group comprising its 
N-terminus and a second stabilizing group comprising its C-terminus, or it 
includes an opposite charge ending motif, both as described hereinabove. The 
cleavage site immediately precedes the N-terminal stabilizing group or 

25 immediately follow s the C-terminal stabilizing group. In the case of a bioactive 
peptide having an opposite charge ending motif, the cleavage site immediately 
precedes the first charged region or immediately follows the second charged 
region. The cleavage site makes it possible to administer a bioactive peptide in a 
form that could allow intracellular targeting and or activation. 

30 Alternatively, a bioactive peptide of the invention can be fused to a 

noncleavable N-terminal or C-term:nai targeting sequence wherein the targeting 
sequence allows tarred delivery of the bioactive peptide, e.g.. intracellular 
targeting r- .issue-specific targeting of the bioactive peptide. In one 
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embodiment of tins aspect of the in\ention. live free terminus of the bioaetixe 
peptide comprises a stabilizing group as described hereinabove in connection 
with the screening method for identifying bioucme peptides, tor example one or 
more prolines. The targeting sequence forming the other peptide terminus can. 
5 but need not. contain a small stable protein such as Rop or one or more prolines 
comprising its terminus, as long as the targeting function of the targeting 
sequence is preserved. In another embodiment of this aspect of the invention, 
the bioactive peptide comprises a charge ending motif as described hereinabove, 
w herein one charged region occupies the free terminus of the bioactive peptide. 

10 and the other charged region is disposed between the targeting sequence and the 
active sequence of the bioactive peptide. 

The invention further includes a method for using an antimicrobial 
peptide that includes covalently linking a stabilizing group, as described 
hereinabove, to the N-terminus, the C-terminus. or to both termini, to yield a 

15 stabilized antimicrobial peptide, then contacting a microbe with the stabilized 

antimicrobial peptide. Alternatively, the stabilized antimicrobial peptide used in 
this aspect of the invention is made by covalently linking oppositely charged 
regions, as described hereinabove, to each end of the antimicrobial peptide to 
form an opposite charge ending motif. An antimicrobial peptide is to be broadly 

20 understood as including any bioactive peptide that adversely affects a microbe 
such as a bacterium, virus, protozoan, or the like, as described in more detail 
hereinabov e. An example of an antimicrobial peptide is an inhibitory peptide 
that inhibits the growth of a microbe. When the antimicrobial peptide is 
covalently linked to a stabilizing group at only one peptide terminus, any of the 

25 stabilizing groups described hereinabove can be utilized. When the 

antimicrobial peptide is covalently linked to a stabilizing group at both peptide 
termini, the method includes covalently linking a first stabilizing group to the N- 
terminus of the antimicrobial peptide and a second stabilizing group to the C- 
terminus of the antimicrobial peptide, where the first and second stabilizing 

30 groups are as defined previously in connection with the method for identifying 
bioactive peptides. In a preferred embodiment of the method for using an 
antimicrobial peptide, one or more prolines, more preferably a proline-proline 
dipep t: ' is attached to at least one. preferably both, termini of the antimicro^" 
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peptide. Alternatively, or in addition, an Xaa-Pro- or an Xaa-Pro-Pro sequence 
can be attached to the N-tenninus of a microbial peptide, and or a Pro-Xaa or a 
Pro-Pro-Xaa sequence can be attached to the C-terminus. to yield a stabilized 
antimicrobial peptide. 
5 The antimicrobial peptide thus modified in accordance with the invention 

has enhanced stability in the intracellular environment relative to an unmodified 
antimicrobial peptide. As noted earlier, the unmodified peptide can conveniently 
be used as a control in a protease- or peptidase-resistance assay as described 
hereinabove to confirm, if desired, that the modified peptide exhibits increased 

10 stability. Further, the antimicrobial activity of the antimicrobial peptide is 
preferably preserved or enhanced in the modified antimicrobial peptide; 
modifications that reduce or eliminate the antimicrobial activity of the 
antimicrobial peptide are easily detected and are to be avoided. 

The invention further provides a method for inhibiting the growth of a 

1 5 microbe comprising contacting the microbe with a stabilized inhibitory peptide. 
In one embodiment of this aspect of the invention, the stabilized inhibitory 
peptide has a stabilizing group at the N-terminus, the C-terminus. or to both. 
Preferably, the inhibitory peptide has a first stabilizing group comprising the N- 
terminus of the inhibitory peptide, and a second stabilizing group comprising the 

20 C-terminus of the inhibitory peptide; the first and second stabilizing groups are 
as defined previously in connection with the method for identifying bioactive 
peptides. In another embodiment of this aspect of the invention, the inhibitor) 
peptide is stabilized by the addition of oppositely charged regions to each end to 
form an opposite charge ending motif, as described hereinabove. 

25 Also included in the present invention is a method for treating a patient 

having a condition treatable with a peptide drug, comprising administering to the 
patient a peptide drug that has been stabilized as described herein. Peptide drugs 
for use in therapeutic treatments are well-known (see Table 1 ). However, they 
are often easily degraded in biological systems, which affects their efficacy. In 

30 on embodiment of the present method, the patient is treated with a stabilized 
drug comprising the peptide drug of choice and a stabilizing group attached at 
either the N-te: minus, the C-terminus of. or at both termini of the peptide drug. 
In - .mer embodiment of the present method, the patient is treated with a 
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stabilized drug comprising the peptide drug of choice and stabilized b\ 
attachment of opposite!) charged regions to both termini of the peptide drug. 
Because the peptide drug is thereby stabilized against proteolytic degradation, 
greater amounts of the drug should reach the intended target in the patient. 
5 in embodiments of the method invok ing administration of a peptide drug 

that is covalently linked to a stabilizing group at only one peptide terminus, the 
stabilizing group is preferably a four-helix bundle protein such as Rop protein, 
provided that attachment of the four-helix bundle protein to the peptide terminus 
preserves a sufficient amount of efficacy for the drug. It is to be nonetheless 

1 0 understood that group or groups used to stabilize the peptide drug are as defined 
hereinabov e, without limitation. In embodiments involving administration of a 
peptide drug covalently linked to a stabilizing group at both peptide termini, the 
peptide drug includes a first stabilizing group comprising the N-terminus of the 
peptide drug and a second stabilizing group comprising the C-terminus of the 

1 5 peptide drug. Thus, in another preferred embodiment of the treatment method of 
the invention, the stabilized peptide drug comprises one or more prolines, more 
preferably a proline-proline dipeptide. attached to one or both termini of the 
peptide drug. For example, the peptide drug can be stabilized by covalent 
attachment of a Rop protein at one terminus, and by a proline or proline 

20 dipeptide at the other terminus; in another preferred embodiment, the peptide 
drug can be stabilized by proline dipeptides at each of the N-terminus and C- 
terminus. Alternatively, or in addition, the stabilized peptide drug used in the 
treatment method comprises an Xaa-Pro- or an Xaa-Pro-Pro- sequence at the N- 
terminus of the peptide drug, and/or a -Pro-Xaa or a -Pro-Pro-Xaa sequence at 

25 the C-terminus. Optionally, prior to administering the stabilized peptide drug, 
the treatment method can include a step comprising covalently linking a 
stabilizing group to one or both termini of the peptide drug to yield the 
stabilized peptide drug. 

If desired, the unmodified peptide drug can conveniently be used as a 

30 control in a protease- or peptidase-resistance assay as described hereinabove to 
confirm that the stabilized peptide drug exhibits increased stability. Further, the 
therapeutic ^"iicacy of the peptide drug is preferably preserved or enhanced in 
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the stabilized peptide drug; modifications that reduce or eliminate the therapeutic 
efficacv of the peptide drug are easily detected and are to be av oided. 

The present invention further includes a fusion protein comprising a foui- 
helix bundle protein, preferably Rop protein, and a polypeptide. Preferably the 
5 polvpeptide is bioactive: more preferably it is a bioactive peptide. The fusion 
protein of the invention can be used in any convenient expression vector know n 
in the art for expression or overcxpression of a peptide or protein of interest. 
Optionally, a cleavage site is present between four helix bundle protein and the 
polv peptide to allow cleavage, isolation and purification of the pol vpeptide. In 

10 one embodiment of the fusion protein, the four helix bundle protein is covalently 
linked at its C-terminus to the N-tenninus of the pol vpeptide; in an alternative 
embodiment, the four helix bundle protein is covalently linked at its N'-terminus 
to the C-terminus of the polypeptide. Fusion proteins of the invention, and 
expression vectors comprising nucleic acid sequences encoding fusion proteins 

15 wherein the nucleic acid sequences are operably linked to a regulatory control 
element such as a promoter, are useful for producing or overproducing any 
peptide or protein of interest. 

EXAMPLES 

20 

The present invention is illustrated by the following examples. It is to be 
understood that the particular examples, materials, amounts, and procedures are 
to be interpreted broadly in accordance with the scope and spirit of the invention 
as set forth herein. 
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Example I 

Construction and characterization of a highly regulable expression vector, 
pLACll, and its multipurpose derivatives, pL AC22 and pLAC33 

A number of different expression vectors have been developed over the 
years to facilitate the production of proteins in E. coii and related bacteria. Most 
of the routinely employed expression vectors rely on lac control in order to 
overproduce a gene of choice. While these vectors allow for overexprcssion of 
the gene product of interest, they are leaky due to changes that have been 
introduced into the lac control region and gene expression can never be shut off 
under repressed conditions, as described in more detail below. Numerous 
researchers have noticed this problem with the more popular expression vectors 
pKK223-3 (G. Posfai et al. Gene. 50: 63-67 (1986); N. Scrutton et ah, Biochem 
J. 245: 875-880 (1987)). pKK233-2 (P. Beremand et al.. Arch Biochem 
Biophys. 256: 90-100(1987); K. Ooki et al.. Biochemie. 76: 398-403 (1994)). 
pTrc99A (S. Ghosh, Protein Expr. Purif. 10: 100-106 {1997): J. Ranie et al., 
Mol. Biochem. Parasitol. 61: 159-169 (1993)). as well as the pET series (M. 
Eren et al.. J. Biol. Chem. 264: 14874-14879 (1989); G. Godson, Gene 100: 59- 
64 (1991)). 

The expression vector described in this example. pLACl 1. was designed 
to be more regulable and thus more tightly repressible when grown under 
repressed conditions. This allows better regulation of cloned genes in order to 
conduct physiological experiments. pLACl 1 can be used to conduct 
physiologically relevant studies in which the cloned gene is expressed at levels 
equal to that obtainable from the chromosomal copy of the gene in question. 
The expression vectors described here were designed utilizing the wild-type lac 
promoter/operator in order to accomplish this purpose and include all of the lac 
control region, without modification, that is contained betw een the start of the 
03 auxiliary operator through the end of the Ol operator. As with all lac based 
vectors, the pLACl i expression vector described herein can be turned on or off 
by t 1 _ presence or absence of the gratuitous inducer IPTG. In experiments in 
which a bacterial cell contained both a nHl allele in the chromo cr ...c and a 
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second copy of the wild-type allele on pLACl i. cells grown under repressed 
conditions exhibited the null phenotype while cells grown under induced 
conditions exhibited the wild-type phenotype. Thus the pLACl 1 vector truly 
allows for the gene of interest to be grown under either completely repressed or 
5 fully induced conditions. Two multipurpose derivatives of pLACl 1, pLAC22 
and pLAC33. were also constructed to fulfill different experimental needs. 

The vectors pLACl 1, pLAC22 and pLAC33 were deposited with the 
American Type Culture Collection (ATCC), 10801 University Blvd.. Manassas. 
YA. 201 10-2209. USA. on February 16. 1999. and assigned ATCC deposit 

10 numbers ATCC 207108, ATCC 2071 10 and ATCC 207109. respectively. It is 
nonetheless to be understood that the written description herein is considered 
sufficient to enable one skilled in the art to fully practice the present invention. 
Moreover, the deposited embodiment is intended as a single illustration of one 
aspect of the invention and is not to be construed as limiting the scope of the 

15 claims in anv way. 
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MATERIALS AM) METHODS 

Media. Minimal M9 media (0 g disodium phosphate. 3 g pota>>ium phosphate. 
1 g ammonium chloride. 0.5 g sodium chloride, distilled water to 1L: autoclave: 
add 1 mL magnesium sulfate { 1 M ) and 0.1 ml. calcium chloride ( 1M): a sugar 
added to a final concentration of 0.2° o: vitamins and amino acids as required for 
non-prototrophic strains) and rich LB media ( 10 g tryptone. 5 g yeast extract. 10 
g sodium chloride, distilled water to 1L: autoclave) were prepared as described 
by Miller (J. Miller. "Experiments in molecular genetics" Cold Spring Harbor 
Laboratory, Cold Spring Harbor. N.Y. (1972). The antibiotics ampicillin. 
kanamycin. streptomycin, and tetracycline (Sigma Chemical Company. St. 
Louis. MO) were used in rich media at a final concentration of 100. 40. 200. and 
20 ug/ml, respectively. When used in minimal media, tetracycline was added at 
a final concentration of 10 ug/ml. 5-bromo-4-chloro-3-indoyl [}-d- 
galactopyranoside fXgal) was added to media at a final concentration of 40 
ug/ml and unless otherwise noted. IPTG was added to media at a final 
concentration of 1 mM. 

Chemicals and Reagents. When amplified DNA was used to construct the 
20 plasmids that were generated in this study, the PCR reaction was carried out 

using native Pfu polymerase from Stratagene (Cat. No. 600135). Xgal and IPTG 
were purchased from Diagnostic Chemicals Limited. 

Bacterial Strains and Plasmids. Bacterial strains and plasmids are listed in 
25 Table 4. To construct ALS225. ALS224 was mated with ALS2 1 o and 

streptomycin resistant, blue recombinants were selected on a Rich LB plate that 
contained streptomycin. Xgal. and IPTG. To construct ALS226. ALS224 was 
mated with ALS217 and streptomycin resistant, kanomycin resistant 
recombinants were selected on a Rich LB plate that contained streptomycin and 
30 kanamycin. To construct ALS515. ALS514 was mated with ALS216 and 

streptomveir. resistant, blue recombinants were selected on a Rich LB plate that 
contained streptomycin. Xgal. and IPTG. To construct ALS52". ALS524 was 
mated with ALS224 and streptomv^in resistant, tetracyclic .w-sistant 
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recombinants were selected on a Rich LB plate that contained streptomycin and 
tetracycline. To construct ALS535. ALS533 was mated with ALS49S and 
tetracycline resistant recombinants were selected on a Minimal M9 Glucose 
plate that contained tetracycline, leucine and thiamine tB] ) (Sigma Chemical 
5 Company). To construct ALS533. a PI iysate prepared from E. coli strain 

K5076 (II. Miller et al.. Cell 20: 711-719 (1980)) was used to transduce ALS224 
and tetracycline resistant transduetants were selected. 
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Construction of the pLACT L pLAC22, and pLAC 33 expression vectors. '1 o 

construct pLACl 1. primers "1 and -2 {sec Tabic 5) were u.^ed to PCR amplify a 
952 base pair (bp fragment from the plasmid pBH2(> which contains the wild- 
type Lie opcron. Primer ~-2 introduced two different base pair mutations into the 
5 seven base spacer region between the Shine Dalgarno site and the ATG start site 
of the LieZ which converted it from AACAGCT to AACjATCT thus placing a 
Bgl II site in between the Shine Dalgarno and the start eodon of the LieZ gene. 
The resulting fragment was gel isolated, digested with Pst I and EcoR I. and then 
ligated into the 3614 bp fragment from the plasmid pBR322AAvaI which had 

1 0 been digested with the same two restriction enzymes. To construct 

pBR322AAvaI. pBr322 was digested with Aval, filled-in using Klenow. and 
then religated. To construct pLAC22. a 12 c )] bp Kco I. licoR 1 fragment was gel 
isolated from pLAC21 and ligated to a 4361 bp Nco I. EcoR I fragment which 
was gel isolated from pBR322/NcoI. To construct pLAC2L primers #2 and #3 

1 5 (see Table 5) were used to PCR amplify a 1310 bp fragment from the plasmid 

pMS421 which contains the wild-type lac operon as well as the lacM repressor. 
The resulting fragment was gel isolated, digested with EcoR 1. and then ligated 
into pBR322 which had also been digested with EcoR 1. To construct 
pBR322/Nco I, primers #4 and #5 (see Table 5) were used to PCR amplify a 788 

20 bp fragment from the plasmid pBR322. The resulting fragment was gel isolated, 
digested with Pst 1 and EcoR I. and then ligated into the 3606 bp fragment from 
the plasmid pBR322 which had been digested with the same two restriction 
enzymes. The pBR322/Nco 1 vector also contains added Rpn 1 and Sma I sites 
in addition to the new Nco I site. To construct pLAC33, a 2778 bp fragment was 

25 gel isolated from pLAC12 which had been digested with BsaB 1 and Bsa 1 and 
ligated to a 960 bp fragment from pUC8 which had been digested with Afl III. 
filled-in with Klenow. and then digested with Bsa I. To construct pLAC12. a 
1310 bp Pst I. Bamll 1 fragment was gel isolated from pLACl 1 and ligated to a 
3232 bp Pst I. BamI 1 1 fragment which was gel isolated from pBR322. 



WO 00/22112 



PCT/US99/23731 



Tahlc 5. Primers employed to PCR amplify DN A fragments that were 

used in the construction of the v arious plasmids described in 
Example 1 



? pLACll and pLAC2 2 

1 (for 1 ' aTT...GwC..rtTT...G^X.^a..S:3C..AT (SE-; ID NO: 6) 

2 -.rev* ATT GAA TTC ATA AG A TOT XT.C.CT3 TGX..CvkA..ATT_.GXX_. ATG 
C.G2 ( SEQ ID ICO : 7 ) 

3 (for, ATT £AA XXC A££ ATG GAG AC G ATA. .GAA 7.GG...TGG.. AAA.. A 
10 ( SEQ ID NO: 8 ) 

pER2 2 2/Nco I 

4 (for; GTT. GIT GCC ATT GOT GCA G (SEC " NO : 5 ) 

5 (rev; TGT ATG AAT TCC CGG GTA CCA X22 TTG. . AAG . .AGG. . AAA . .GGG 
1 5 C.GT. .G ( SEQ ID NO : 1 0 } 

Egl II - lacG - Hind III 

6 (for) TAG TAT ftGA TCT AT.G. . AGG. . AT.G. . ATG . AGG . .GAT. . .T.CA..CXG ( SEQ 
ID NO: 11) 

20 7 (rev) TAG ATA AAG CTT GGG. ..CTG_. CCC .GGX. . TAT. .TAT... TAT. ..TXT {SEQ 

ID NO: 12 ) 

Pst I - lacZ - Hind III 

8 (for) TAT CAT CTG ££G AGG AAA CAG CTA. . .TG A. . .CCA . . XGA . .TTA. . .CGG 
25 A7.T...CAC..XG ( SEQ ID NO : 1 3 ) 

9 (rev) TAG ATA CTC GAG CAG GA£ AGG TT G . GGG.. .TGG . . GGG. ..GTT.. A7T 
AT.T..ATX..TX (SEQ ID NO: 14) 

Bam! I 1 - lacZ - Hind III (also uses primer #9! 

30 10 (for) TAT CAT GGA TCC AGG AAA CAG CTA . .XGA...CCA..TGA..TXA...C.GG 

AT.T...CAG..XG ( SEQ ID NO : 15 ) 

Bgl II - recA - Hind III 

11 (for) TAG TAT AGA TCT ATG . .GGX. . AT.G . .GAG. . GAA . .AAG. . AAA . .GAG (SEQ 
35 ID NO : 16 ) 

12 (rev) ATA TAT AAG QTT XX A . AAA. . AT.G. . . TTG. . .GXX. . AGT . . X.T.C. . .TG G. . .TAG 
G (SEQ ID NO : 17 ) 

BamK 1 - xylE - EccR I 

40 12 (for) TAG TAT X21 AXG. . AA. G . . AAA, . . G GT. . . G X A . . ATG . . GGA . . C C ( SEQ 

ID NO : 18 ; 

14 i rev; ATT AGT GAA TXG GGA ..GAA. . TGT . CTG. CAA..TAA. GT.G...GX (SEQ 



ID NO : 1 9 
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In Table 5 the regions of" the primers that are homologous to the DNA target 
template are indicated with a dotted underline, while the relevant restriction sites 
are indicated with a solid underline. All primers are listed in the 5' —> 3' 
orientation. 

5 

Compilation of the DN A sequences for the pLACll, pLAC22, and pLAC33 
expression vectors. All of the DNA that is contained in the pLACl 1. pLAC22. 
and. pLAC33 vectors has been sequenced. 

The sequence for the pLACl 1 vector which is 4547 bp can be compiled 

10 as follows: bp 1-15 are AGATCTTATGAATTC (SEQ ID NO:20) from primer 
H2 (Table 5): bp 16-1434 are bp 4-1422 from pBR322 (GenBank Accession p 
J01749); bp 1435-1442 are TCGGTCGG. caused by filling in the Ava I site in 
pBR322AAvaI; bp 1443-4375 is bp 1427-4359 from pBR322 (GenBank 
Accession # J01749): and bp 4376-4547 are bp 1 106-1277 from the wild-type E. 

15 coli lac operon (GenBank Accession # JO 1636). 

The sequence for the pLAC22 vector which is 5652 bp can be compiled 
as follows: bp 1-15 are AGATCTTATGAATTC (SEQ ID NO:21 ) from primer 
#2 (Table 5); bp 16-4370 are bp 4-4358 from pBR322 (GenBank Accession # 
J0174 c >); bp 4371-4376 is CCATGG which is the Nco I site from pBR322/Nco I; 

20 and bp 4377-5652 are bp 2-1277 from the wild-type E. coli lac operon 

(GenBank Accession # J01636K except that bp #4391 of the pLAC22 sequence 
or bp#16 from the wild-type E. coli lac operon sequence has been changed from 
a M C M to a "T" to reflect the presence of the lacl°i mutation (J. Brosius et ah. 
Proc. Natl. Acad. Sci. USA. 81:6929-6933 (1984)). 

25 The sequence for the pLAC33 vector which is 3742 bp can be compiled 

as follows: bp 1-15 is AGATCTTATGAATTC (SEQ ID NO:22) from primer 
#2 (Table 5 ); bp 16-1684 are bp 4-1 672 from pBR322 (GenBank Accession U 
JO 1749); bp 1685-2638 are bp 786-1739 from pUC8 (GenBank Accession s 
L09132): bp 2639-3570 are bp 3428-4359 from pBR322 (GenBank Accession ~ 

30 J01749); and bp 3571-3742 are bp 1 1 06-1 277 from the wild-type E. coli lac 

operon (GenBank Accession ~ JO 1636). In the maps for these vector^ the ori is 
identified as per Baibas (P. Balbas et ah, Geiv\ 50:3-40 M9son while the lacVQ 
is indicatH starting with the 03 p*' .iary oper.^* . and ending with the Ol 
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operator as per Miiller-1 Ii!l < B. Muller-Mill. 1 he A-,v < >peron: A Short Mistorv of 
a Genetic Paradigm . Waller de Gruyter. Berlin. Germam ( 1^96)). 

Construction of the pLACl pLAC22-, pLAC33-, pKK223-3-, pKK233-2-, 
5 pTrc99A-, and pET-21(+)- lacZ constructs. J o construct pLACl \ -iacZ. 

pEAC22-/acZ. and pLAC33-/at*Z, primers ~6 and ^7 (see Table 5) were used to 
PCR amplify a 31 15 bp fragment from the plasmid pier? w hich contains the 
wild-type lacZ gene. The resulting fragment was gel isolated, digested with Bgl 
II and Hind III. and then ligated into the pLACl 1. pLAC22. or pLAC33 vectors 

10 which had been digested with the same two restriction enzymes. To construct 
pKK223-3-/c;cZand pKR233-2-/acZ. primers £8 and #9 (see Table 5) were used 
to PCR amplify a 3137 bp fragment from the plasmid pTer7. The resulting 
fragment was gel isolated, digested with Pst 1 and Mind III. and then ligated into 
the pKK223-3 or pKK233-2 vectors which had been digested with the same two 

\5 restriction enzymes. To construct pTrc99A-/acZ and pET-21 { + ')-IacZ, primers 
#9 and #10 (see Table 5) were used to PCR amplify a 3137 bp fragment from the 
plasmid pTer7. The resulting fragment was gel isolated, digested w ith BamH I 
and Hind III, and then ligated into the pTrc99A or pET-21(+) vectors which had 
been digested with the same two restriction enzymes. 

20 

Construction of the pLACl 1- recA and xylE constructs. To construct 
pLACl \ -recA. primers #1 1 and #12 (see Table 5) were used to PCR amplify a 
1085 bp fragment from the plasmid pGE226 which contains the wild-type recA 
gene. The resulting fragment was gel isolated, digested with Bgl II and Mind III, 

15 and then ligated into the pLACl 1 vector which had been digested with the same 
two restriction enzymes. To construct pLACl l-vv/A*, primers £13 and #14 (see 
Table 5) were used to PCR amplify a 979 bp fragment from the plasmid pXE60 
which contains the wild-type Pseudomonas puiida xylE gene isolated from the 
TOL p\V\VO plasmid. The resulting fragment was gel isolated, digested with 

>0 Bgl II and EcoR I. and then ligated into the pLACl 1 vector which had been 
digested with the same two restriction enzymes. 
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Assays. |)-galactosidase assa\ s were performed as described b> Miller (J. 
Miller. "Experiments in molecular genetics" Cold Spring Harbor Laboratory 
Cold Spring Harbor. N.Y. ( 1972 m. w hile catechol 2.3-dio\ygenase (cat( >2ase) 
assays were performed as described b\ Zukowski. et. al. f M. Zukowski et aL 
5 Proe. Nail. Acad. Sei. U.S.A. 80:1 101-1 105 ( 1983)). 

RESULTS 

Construction and features of pLACIl, pLAC22, and pLAC33. Plasmid 

10 maps that indicate the unique restriction sites, drug resistances, origin of 

replication, and other relevant regions that are contained in pLACl 1, pLAC22. 
and pLAC33 are shown in Figs. 2. 3 and 4. respectively. pLACl 1 was designed 
to be the most tightly regulable of these vectors. It utilizes the ColEl origin of 
replication from pBR322 and Lac I repressor is provided in trans from either an 

1 5 episome or another compatible plasmid. pLAC22 is very similar to pLACl 1 . 
however, it also contains lacH, thus a source of LacI docs not have to be 
provided in trans. pLAC33 is a derivative of pLACl 1 which utilizes the mutated 
ColEl origin of replication from pUC8 (S. Lin-Chao et al., Mol. Micro. 6:3385- 
3393 (1992)) and thus pLAC33's copy number is significantly higher than 

20 pLACl 1 and is comparable to that of other pUC vectors. Because the cloning 
regions of these three vectors are identical, cloned genes can be trivially shuffled 
betw een and among them depending on the expression demands of the 
experiment in question. 

To clone into pLACl 1. pLAC22. or pLAC33. PCR amplification is 

25 performed with primers that are designed to introduce unique restriction sites 
just upstream and downstream of the gene of interest. Usually a Bgl II site is 
introduced immediately in front of the ATG start codon and an EcoR I site is 
introduced immediately following the stop codon. An additional 6 bases is added 
to both ends of the oligonucleotide in order to ensure that complete digestion of 

30 the amplified PCR product will occur. After amplification the double-stranded 
(ds) DNA is restricted with Bgl II and EcoR 1. arC /,oned into the vector which 
has also been restricted with the sam*; .wo enzymes. If the gene of interest 
contains a Bp 1 site, then T? .,nl! I or Bel 1 can be used instead since thev 
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generate overhangs which arc compatible with BliI II. if the gene of interest 
contains an hcoR 1 site, then a site downstream olTieoR 1 in the vector (such as 
1 Imd III ) can he substituted. 

5 Comparison of pLACll, pLAC22. and pLAC 33, to othe r expression 
vectors. In order to demonstrate how regulable the pLAC 1 1 . pL AC22. and 
pLAC33 expression vectors were, the wild-type UicZ gene was cloned into 
pLACl 1. pLAC22. pLAC33. pkk223-3, pKK233-2. plrcWA. and P ET-21(+). 
Constructs which required an extraneous source of Lacl lor their repression were 

10 transformed into ALS225. while constructs which contained a source of Lacl on 
the vector were transformed into ALS224. pET-21{ + ) constructs were 
transformed into BL21 because they require T7 RNA polymerase for their 
expression. Four clones were chosen for each of these seven constructs and p- 
galactosidase assays were performed under repressed and induced conditions. 

15 Rich Amp overnights were diluted 1 to 200 in either Rich Amp Glucose or Rich 
Amp IPTG media and grown until they reached mid-log (OD550 = 0.5). In the 
case of pET-21(+) the pLysE and pLysS plasmids. which make T7 lysozyme and 
thus lower the amount of available T7 polymerase, w ere also transformed into 
each of the constructs. Table 6 shows the results of these studies and also lists 

20 the induction ratio that w as determined for each of the expression vectors. As 
the data clearly indicates, pLACl 1 is the most regulable of these expression 
vectors and its induction ratio is close to that which can be achieved with the 
w ild-type lac operon. T he vector which yielded the lowest level of expression 
under repressed conditions was pLACl K while the vector which yielded the 

25 highest level of expression under induced conditions w as pLAC33. 
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Fable 6: ()-galactosidase levels obtained in different expression \cctoi> 

gro\%n under either repressed or induced conditions 



Vector 


Source 


ff of .Miller I nits Observed 






Repressed 
Conditions 


1 nduced 
C onditions 


r old 
1 nduction 


pLACI 1 


F' 


W 


1 1209 


590X 


pLAC22 


IMasmid 


152 


13315 


88X 


pLAC33 


F' 


322 


23443 


73 X 


pKK223-3 


F' 


92 


1 1037 


120X 


pkk233-2 


F 


85 


10371 


122X 


pTrc99A 


Plasm id 


261 


21381 


S2X 


pET-21(-) 


IMasmid 


292 l > 


16803 


6X 


pET-21(-) / pLysE 


Plasm id 


4085 


19558 


5X 


pET-21( + ) / pLysS 


Plasmid 


1 598 


20268 


I3X 



10 



The average values obtained for the four clones that were tested from each 
vector are listed in the table. Standard deviation is not shown but was less than 
5% in each case. Induction ratios are expressed as the ratio of enzymatic activity 
observed at fully induced conditions versus fully repressed conditions. The 
plasmid pLysh yielded unexpected results; it was expected to cause lower 
amounts of lacZ to be expressed from pET-21( + ) under repressed conditions 
and. instead, higher amounts were observed. As a result, both pLysE and pLysS 
were restriction mapped to make sure that they were correct. 



Demonstrating that pLACll constructs can be tightly regulated. pLACI 1 
15 was designed to provide researchers with an expression vector that could be 

utilized to conduct physiological experiments in which a cloned gene is studied 
under completely repressed conditions w here it is off or partially induced 
conditions where it is expressed at physiologically relevant levels. Figure 5 
demonstrates how a pLACI 1 -lacZ construct can be utilized to mimic 
20 chromosomal ly expressed iarZ ;hut occurs under various physiological 

conditions by vary'^g the amount oflPTG inducer that is added. ALS226 cells 
containi T :_. pi^ACT 1 ~IacZ were grown to mid-log in rich media that contained 
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varying amounts of IPIG and then | >-galactosidase activitv was a>sa\ed. Also 
indicated in the graph are the average (5-galactosidase activities obtained tor 
strains with a single chromosomal copy of the wild-type lucZ gene that were 
grown under different conditions. 
5 To demonstrate just how regulable pLACl l is, the recA gene was cloned 

into the pLACl l vector and transformed into cells which contained a null rccA 
allele in the chromosome. As the results in Table 7 clearly shows, 
recombination can not occur in a host strain which contains a nonfunctional 
RecA protein and thus PI lysates which provide a TnlOdKan transposon can not 

l 0 be used to transduce the strain to Kan R at a high frequency. rccA' cells which 
also contain the pLACl \ -rccA construct can be transduced to Kan R at a high 
frequency when grown under induced conditions but can not be transduced to 
Kan R when grown under repressed conditions. 

15 Table 7: The recombination (-) phenotype of a recA null mutant strain 
can be preserved with a pLACll-recA (wild-type) construct 
under repressed conditions 





Repressed Conditions 


Induced Conditions 


Strain 


Number of Kan^ 


Number of Kan^ 




transductants 


transductants 


ALS225 (recA*) 


173 , 000 


182 , 000 


ALS514 {recA~) 


5 


4 


ALS5 1 5 (recA' pCyi-3-recA) 


4 


174,000 



20 The data presented in T able 7 are the number of Karf* transductants that were 
obtained from the different MCI 061 derivative strains when they were 
transduced with a PI hsate prepared from strain ALS598 which harbored a 
TnlOdKan transposon insertion. Overnights were prepared from each of these 
strains using either rich medium to which glucose was added at a final 

25 concentration of 0.2°r, (repressed conditions) or rich medium to which IPTG was 
added at ~- anal concentration of 1 mM (induced conditions). The overnights 
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were then diluted 1 to 10 into the same medium which contained CaCh added to 
a final concentration of 10 mM and aerated for two hours to make them 
competent for transduction with PI phage. Cells were then 

spectrophotomctrically normalized and aliquots of 5 OD550 cell equivalents in a 
5 volume of approximately 0.1 ml were transduced with 0.1 ml of concentrated PI 
lysate as well as 0.1 ml of PI lysates that had been diluted to 10" 1 , 10"-. or 10°. 
0.2 ml of 0.1 M Sodium Citrate was added to the cell phage mixtures and 0.2 ml 
of the final mixtures were plated onto Rich Kanamycin plates and incubated 
overnight at 37"C. The total number of kan R colonies were then counted. 

10 ALS225 recA + data points were taken from the transductions which used the 10" 
3 diluted phage, while ALS514 recA~ data points were taken from the 
transductions which used the concentrated phage. The data points for ALS515 
recA~ pCyl-3-rccA grown under repressed conditions were taken from the 
transductions which used the concentrated phage, while the data points for 

15 ALS515 rccA' pCyt-3-n?cvl grown under induced conditions were taken from 
the transductions which used the 10° diluted phage. 

Testing various sources of LacI for trans repression of pLACl 1. Because 
pLACl 1 was designed to be used with an extraneous source of LacI repressor, 

20 different episomal or plasmid sources of LacI which are routinely employed by 
researchers were tested. Since one of the LacI sources also contained the lacZ 
gene, a reporter construct other than pL AC 1 1 -lacZ was required and thus a 
pLACl \-xylE construct was engineered. Table 8 shows the results of this study. 
All of the LacI sources that were tested proved to be adequate to repress 

25 expression from pLACl 1 , however, some were better than others. The basal 

level of expression that was observed with F's which provided lacH^ or with the 
plasmid pMS421 which provided lacH at approximately six copies per cell was 
lower than the basal level of expression that w as observed with F's which 
provided lacH all three times that the assay was run. Unfortunately, however, 

30 the xylE gene couiu not be induced as high when lacl^ on a F' or lacH on a 
plasmid was used as the source of Lac repressor. 
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Table 8: Catechol 2,3-dioxygenasc levels obtained for a pLACl \-xylE 

construct when Lac repressor is provided by various sources 

5 



Strain 


Source of Lad 


Catechol 2,3-dioxygenase activity in 
milliunits/mg 


Rep ressed 
Conditions 


Induced 
Conditions 


ALS224 


None 


32.7 


432.8 


ALS535 


VlacH A(lacZ)M15 
proA+B+ TnlO 


. 3 


204.4 


ALS527 


FlacR A(lacZ)M15 
proA+B+ 


.3 


243.3 


ALS227 


pMS421 lacft 


_2 


90.9 


ALS225 




2 


107.4 


ALS226 


riacf\ ] Z::Tn5 }^ A^ 


.2 


85.1 



The wild-type xylE gene was cloned into the pLACl 1 vector and the resulting 
pLACl l-rv/£ construct was then transformed into each of the MCI 061 
derivative strains listed in the table. Rich overnights were diluted 1 to 200 in 

1 0 either Rich Glucose or Rich IPTG media and grown until they reached mid-log 
(OD550 = 0.5). Cell extracts were then prepared and catechol 2,3-dioxygenase 
assays were performed as described by Zukowski. et. al (Proc. Natl. Acad. Sci. 
U.S.A. 80:1 101-1 105 (1983)). The average values obtained in three different 
experiments are listed in the table. Standard deviation is not shown but was less 

15 than 10% in each case. 

DISCUSSION 

Most of the routinely employed expression vectors rely on lac control in 
20 order to overproduce a gene of choice. The lac promoter/operator functions as 1 
uoes due to the interplay of three main components. First, the wild-';. ^ lac -10 
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region (TATGTT) is very weak. c-AMP activated CAP protein is able to bind to 
the CAP site just upstream of the -35 region which stimulates binding of RNA 
polymerase to the weak -10 site. Repression of the lac promoter is observed 
when glucose is the main carbon source because very little c-AMP is present 
5 which results in low amounts of available c-AMP activated CAP protein. When 
poor carbon sources such as lactose or glycerol are used. c-AMP levels rise and 
large amounts of c-AMP activated CAP protein become available. Thus 
induction of the lac promoter can occur. Second, Lac repressor binds to the lac 
operator. Lac repressor can be overcome by allolactose which is a natural 
1 0 byproduct of lactose utilization in the cell, or by the gratuitous inducer, IPTG. 
Third, the lac operator can form stable loop structures which prevents the 
initiation of transcription due to the interaction of the Lac repressor with the lac 
operator (Ol) and one of two auxiliary operators, 02 which is located 
downstream in the coding region of the lacZ gene, or 03 which is located just 
1 5 upstream of the CAP binding site. 

While binding of Lac repressor to the lac operator is the major effector of 
lac regulation, the other two components are not dispensable. However, most of 
the routinely used lac regulable vectors either contain mutations or deletions 
which alter the affect of the other two components. The pKK223-3 (J. Brosius et 
20 al., Proc. Natl. Acad. Sci. USA. 81:6929-6933 (1984)), pKK233-2 (E. Amann et 
al., Gene. 40:183-190 (1985)), pTrc99A (E. Amann et ah. Gene. 69:301-315 
(1988) ), and pET family of vectors (F. Studier, Method Enzymol. 185:60-89 
(1990)) contain only the lac operator (Ol) and lack both the CAP binding site as 
well as the 03 auxiliary operator. pKK223-3, pKK233-2. and P Trc99 use a trp- 
25 lac hybrid promoter that contains the trp -35 region and the /<?cUV5 -10 region 
which contains a strong TATAAT site instead of the weak TATGTT site. The 
pET family of vectors use the strong T7 promoter. Given this information, 
perhaps it is not so surprising researchers have found it is not possible to tightly 
shut off genes that are cloned into these vectors. 
30 The purpose of the studies described in Example I was to design a vector 

which would allow researchers to better regulate their cloned genes in order to 
conduct physiological experiments. The expression vectors described herein 
were designed utilizing the wild-type lac promoter/operator in ordr: 10 
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accomplish this purpose and include all of the lac control region, without 
modification, that is contained betw een the start of the 03 auxiliary operator 
through the end of the Ol operator. As with all lac based vectors, the pLACl 1. 
pLAC22, and pLAC33 expression vectors can be turned on or off by the 
5 presence or absence of the gratuitous inducer IPTG. 

Because the new vector, pLACl 1. relies on the wild-type lac control 
region from the auxiliary lac 03 operator through the lac Ol operator, it can be 
more tightly regulated than the other available expression vectors. In direct 
comparison studies with pKR223-3. pKK233-2. pTrc^A. and pF.T-2H+). the 
10 lowest level of expression under repressed conditions was achievable with the 
pLACl 1 expression vector. Under fully induced conditions, pLACl 1 expressed 
lacZ protein that was comparable to the levels achievable with the other 
expression vectors. Induction ratios oflOOOx have been observed with the wild- 
type lac operon. Of all the expression vectors that were tested, only pLACl 1 
1 5 yielded induction ratios which were comparable to what has been observed with 
the wild-type lac operon. It should be noted that the regulation achievable by 
pLACl 1 may be even better than the data in Table 6 indicates. Because lacZ 
was used in this test, the auxiliary lac 02 operator which resides in the coding 
region of the lacZ gene was provided to the pKK223-3, pKK233-2, pTrc99A, 
20 and pET-21(+) vectors which do not normally contain either the 02 or 03 

auxiliary operators. Thus the repressed states that were observed in the study in 
Table 6 are probably lower than one would normally observe with the pKK223- 
3, pKK233-2, pTrc99A, and pET-21( + ) vectors. 

To meet the expression needs required under different experimental 
25 circumstances, two additional expression vectors which are derivatives of 

pLACl 1 were designed. pLAC22 provides lacH on the vector and thus unlike 
pLACl 1 does not require an extraneous source of Lac I for its repression. 
pLAC33 contains the mutated Col hi rephcon from pUC8 and thus allows 
proteins to be expressed at much higher levels due to the increase in the copy 
30 number of the vector. Of all the expressions that were evaluated in direct 

comparison studies, the highest level of protein expression under fully induced 
conditions was achieved using the pLAC33 vector. Because the cloning regions 
are identical in pLACIL pLAC22. a:v;i pLAC33. genes that z\-: cloned into one 
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of these vectors can be trivially subcloned into either of the other two vectors 
depending on experimental circumstances. For physiological studies. pLACl 1 
is the best suited of the three vectors. If however, the bacterial strain of choice 
can not be modified to introduce elevated levels of Lac repressor protein which 
5 can be achieved by F's or compatible plasmids that provide lacH or lacWK the 
pLAC22 vector can be utilized. If maximal overexpression of a gene product is 
the goaf then the pLAC33 vector can be utilized. 

Numerous experiments call for expression of a cloned gene product at 
physiological levels; i.e., at expression levels that are equivalent to the 
10 expression levels observed for the chromosomal copy of the gene. While this is 
not easily achievable with any of the commonly utilized expression vectors, 
these kinds of experiments can be done with the pLACl 1 expression vector. By 
varying the IPTG concentrations, expression from the pLACl 1 vector can be 
adjusted to match the expression levels that occur under different physiological 
1 5 conditions for the chromosomal copy of the gene. In tact, strains which contain 
both a chromosomal null mutation of the gene in question and a pLACl 1 
construct of the gene preserve the physiological phenotype of the null mutation 
under repressed conditions. 

Because the use of Lac repressor is an essential component of any 
20 expression vector that utilizes the lac operon for its regulation, the ability of 
different source of Lac I to repress the pLACl 1 vector was also investigated. 
Researchers have historically utilized either lacH constructs which make 10 fold 
more Lac repressor than wild-type lad or IcicI°l ] constructs which make 100 fold 
more Lac repressor than wild-type lad (B. Muller-Hill. Prog. Biophys. Mol. 
25 Biol. 30:227-252 (1975)). The greatest level of repression of pLACl 1 constructs 
could be achieved using F's which provided approximately one copy of the 
/ucM 1 eene or a multicopy compatible plasmid which provided approximately 
six copies of the lacM gene. However, the induction that was achievable using 
these lad sources was significantly lower than what could be achieved when F's 
30 which provided approximately one copy of the lacB gene were used to repress 
th-j pLACl 1 construct. Thus if physiological studies are the goal oi an 
investigation, then F's which provide approximately one cony of the lad° ] gene 
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or a multicopy compatible plasmid which provides approximately six copies oi 
the lucH gene can be used to regulate the pLACl 1 vector. However, if maximal 
expression is desired, then F's w hich provide approximately one copy of the 
lacH gene can be utilized. Alternatively, if a bacterial strain can tolerate 
5 prolonged overexpression of an expressed gene and ovcrexpression of a gene 
product is the desired goal, then maximal expression under induced conditions is 
obtained when a bacteria strain lacks any source of Lac repressor. 

Example II 

10 

An in vivo approach for generating novel bioactive peptides that inhibit the 

growth of E, coli 

A randomized oligonucleotide library containing sequences capable of 

15 encoding peptides containing up to 20 amino acids was cloned into pLACl 1 
(Example 1) which allowed the peptides to either be tightly turned off or 
overproduced in the cytoplasm of E. coli. The randomized library was prepared 
using a [KNN] codon design instead of either the [NN(G.T)] or [NN(G,Q] 
codon design used by most fusion-phage technology researchers. [NN(G,T)] or 

20 [NN(G.C)] codons have been widely used instead of [NNN] codons to eliminate 
two out of the three stop codons, thus increasing the amount of full-length 
peptides that can be synthesized without a stop codon (J. Scott et ah, Science 
249:386-390 (1990); J. Delvin et ah. Science 249:404-406 (1990); S. Cwirla et 
ah, Proc. Naff Acad. Sci. U.S.A. 87:6378-6382 (1990)). However, the 

25 fNN(G.T)] and [KN(G.C)] oligonucleotide codon schemes eliminate half of the 
otherwise available codons and. as a direct result, biases the distribution of 
amino acids that are generated. Moreover, the (NN(G.T)] and (NN(G.C)] codon 
schemes drastically affect the preferential codon usage of highly expressed genes 
and removes a number of the codons which are utilized by the abundant tRNAs 

30 that are present in E. coli (H. Grosjean et aL Gene. 18: 199-209 (1982); T. 
Ikemurr, j. Mol. Biol. 151: 389-409 (1981)). 

Of the 20.000 peptides screened in this Example, 21 inhibitors of cell 
growth were found which could prevent the growth of E coli on minimal media. 
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The top twenty inhibitor peptides were evaluated for strength of inhibition, and 
the putative amino acid sequences of the top 10 "anchorless" inhibitor peptides 
were examined for commonly shared features or motils. 

5 MATERIALS AND METHODS 

Media. Rich LB and minimal M9 media used in this study was prepared as in 
Example I. Ampicillin was used in rich media at a final concentration of 100 
ug/ml and in minimal media at a final concentration of 50 ug/ml. IPTG was 
1 0 added to media at a final concentration of 1 mM. 

Chemicals and Reagents. Extension reactions were carried out using Klenow 
from New England Biolabs while ligation reactions were performed using T4 
DNA Ligase from Life Sciences. IPTG was obtained from Diagnostic 
1 5 Chemicals Limited. 

Bacterial Strains and Plasmids. ALS225, which is MC\06\ fY l lad^Z+Y+A+ 
(see Example I), was the E. coll bacterial strain used in this Example. The 
genotype for MC1061 is araD139 A(araABOIC-leu)7679 A(lac)X74 gal U gal K 
20 rpsL hsr- hsm+ (M. Casadaban et ah, J. Mol. Biol. 138:179-207 (1980)). 
pLACl 1. a highly regulable expression vector, is described in Example I. 

Generation of the Randomized Peptide Library. The 93 base oligonucleotide 
5' 1 AC TAT AGA TCT ATG (NKN)20 TAA TAA GAA TTC TCG ACA 3" 

25 (SEQ ID NO: 23), where N denotes an equimolar mixture of the nucleotides A. 
C. G. or T, was synthesized with the trityl group and subsequently purified with 
an OPC cartridge using standard procedures. The complementary strand of the 
93 base oligonucleotide was generated by an extensioaTill-in reaction with 
Klenow using an equimolar amount of the 18 base oligonucleotide primer 5' 

30 TGT CGA GAA TTC TTA TTA 3' (SEQ ID NO: 24). After extension, the 

resulting ds-DNA was purified using a Promega DNA clean-up kit and restricted 
with EcoR 1 and Bgl II (Promega. Madison, WI). The digested DNA was again 
purified using a Promega DNA clean-up kit and liga'e.i to pEACl 1 vector which 
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had been digested with the same two restriction enzymes. The resulting library 
was transformed into electrocompetent ALS225 E. coll cells under repressed 
conditions (LB. ampicillin, plus glucose added to 0.2H 0 ). 

Screening of Transformants to Identify Inhibitor Clones Transfbrmants 

5 were screened to identity any that could not grow on minimal media when the 
peptides were overproduced. Using this scheme, any transibrmant bacterial 
colony that overproduces a peptide that inhibits the production or function of a 
protein necessary for growth of that transfonnant on minimal media will be 
identified. Screening on minimal media, which imposes more stringent growth 

10 demands on the cell, will facilitate the isolation of potential inhibitors from the 
library. It is well known that growth in minimal media puts more demands on a 
bacterial cell than growth in rich media as evidenced by the drastically reduced 
growth rate; thus a peptide that adversely affects cell growth is more likely to be 
detected on minimal media. Screening was carried out using a grid-patching 

1 5 technique. Fifty clones at a time were patched onto both a rich repressing plate 
(LB Amp glucose) and a minimal inducing plate (M9 glycerol Amp IPTG) using 
an ordered grid. Patches that do not grow are sought because presumably these 
represent bacteria that are being inhibited by the expressed bioactive peptide. To 
verify that all of the inhibitors were legitimate, plasmid DNA was made from 

20 each inhibitory clone (QIA Prep Spin Miniprep kit; Qiagen Cat. No. 27104) and 
transformed into a fresh background ( ALS225 cells), then checked to confirm 
that they were still inhibitory on plates and that their inhibition was dependent 
on the presence of the inducer, IPTG. 

25 Growth Rate Analysis in Liquid Media. Inhibition strength of the peptides 
was assessed by subjecting the inhibitory clones to a growth rate analysis in 
liquid media. To determine the growth rate inhibition, starting cultures of both 
the peptides to be tested and a control strain which contains pLACl 1 were 
diluted from a saturated overnight culture to an initial OD550 of -.01. All 

30 cultures were then induced with 1 mM IPTG and OD.w- readings were taken 
until the control culture reached an OD 55( , of -0.5. The hypothetical data in 
'fable 9 shows that when the control strain reaches an OD : - of about 0.64 (at 
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about 15 hours), a strain which contains a peptide that inhibits the growth rate at 
50% will only have reached an OD 5?l , of only about 0.08. Thus, the growth of a 
50% inhibited culture at 15 hours (i.e.. the OD 5 $o at hours, which is 
proportional to the number of cells in a given volume of culture) is only about 
5 1 2.5% (that is. 0.08/0.64 x 100) of that of a control strain after the same amount 
of time, and the inhibitor peptide would thus have effectively inhibited the 
growth of the culture (as measured by the OD550 at the endpoint) by 87.5% 
(=100% - 12.5%). 

10 Table 9: Hypothetical data from a peptide that inhibits growth rate at 
30%, 50% and 70% 



Time in hours 


OD550 readings on a 
control culture which 
contains pLAC 1 1 


OD550 readings on a culture \vh 
peptide that inhibits the growth r 


ch contains a 
ate at... 






25% 


50% 


75% 


0 


.010 


.010 


.010 


.010 


2.5 


.020 


.017 


.015 


.012 


5 


.040 


.028 


.020 


.014 


7.5 


.080 


.047 


.030 


.017 


10 


.160 


.079 


.040 


.020 


12.5 


.320 


.133 


.060 


.024 


15 


.640 


.226 


.080 


.028 



1 5 An example is shown in Fig. 6, wherein ALS225 cells containing the 

pLACl 1 vector (control), and either the one day inhibitor pPepl or the two day 
inhibitor pPepl2 (see below), were grown in minimal M9 glycerol media with 
IPTG added to 1 mM. OD550 readings were then taken hourly until the cultures 
had passed log phase. Growth rates were determined by measuring the 

20 spectrophotometry change in OD55O unit time within the log phase of 

growth. The inhibition of the growth rate was then calculated for the inhibitors 
using pl.ACI 1 as a control. 

Sequencing the Coding Regions of the Inhibitor Peptide Clones. The 

25 forward primer 5' TCA TTA ATG CAG CTG GCA CG .V (SEQ ID NO: 25) and 
the reverse primer 5* TTC ATA CAC GOT GCC TGA CT 3' (SFQ ID NO: 26) 
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were used to sequence both strands of the top ten "anchorless" inhibitor peptide 
clones identified by the grid-patching technique. If an error- free consensus 
sequence could not be deduced from these two sequencing runs, both strands of 
the inhibitor peptide clones in question were resequenced using the forward 
5 primer 5' TAG CTC ACT CAT TAG GCA CC 3' (SEQ ID NO: 27) and the 
reverse primer 5' GAT GAC GAT GAG CGC ATT GT 3* (SEQ ID NO: 28). 
The second set of primers were designed to anneal downstream of the first set of 
primers in the pLACl 1 vector. 



1 0 Generating Antisense Derivatives of the Top Five "Anchorless" Inhibitor 
Clones. Oligonucleotides were synthesized which duplicated the DNA insert 
contained between the Bgl II and EcoR I restriction sites for the top five 
"anchorless" inhibitor peptides as shown in Table 12 with one major nucleotide 
change. The "T M of the ATG start codon was changed to a "C" which resulted in 

1 5 a ACG which can not be used as a start codon. The oligonucleotides were 

extended using the same 18 base oligonucleotide primer that was used to build 
the original peptide library. The resulting ds-DNA was then restricted, and 
cloned into pLACl 1 exactly as described in the preceding section "Generating 
the randomized peptide library". The antisense oligonucleotides that were used 

20 are as follows: 

pPepl (antisense): 5' TAC TAT AG A TCT ACG GTC ACT GAA TTT TGT 
GGC TTG TTG GAC CAA CTG CCT TAG TAA TAG TGG AAG GCT GAA 
ATT AAT AAG AAT TCT CGA CA 3' (SEQ ID NO: 29): 

25 

pPep5(antisense): 5' TAC TAT AGA TCT ACG TGG CGG GAC TCA TGG 
ATT AAG GGT AGG GAC GTG GGG TTT ATG GGT TAA AAT AGT TTG 
ATA ATA AGA ATT CTC GAC A 3' (SEQ ID NO: 30) 

30 pPepl2(antisense): 5' TAC TAT AGA TCT ACG AAC GGC CGA ACC AAA 
CGA ATC CGG GAC CCA CCA GCC GCC TAA AC A GCT ACC AGC TGT 
GGT AAT AAG AAT TCT CGA CA 3' (SEQ ID NO: 31) 
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pPe P 13(antisense): 5' TAC TAT AGA TCT ACG GAC CGT GAA GTC, ATG 
TGT GCG GCA AAA CACi GAA TGG AAG GAA CGA ACG CCA TAG 
GCC GCG TAA TAA GAA TTC ICG ACA ?' < SLQ ID NO: 32) 

5 P Pepl9iantisense): 5' TAC TAT AGA TCT ACG AGG GGC GCC AAC TAA 
GGG GGG GGG AAG GTA TTT GTC CCG TGC ATA ATC TCG GGT GTT 
GTC TAA TAA GAA TTC TCG ACA 3 T (SF-Q ID NO: 33) 

RESULTS 

10 

Identifying and Characterizing Inhibitor Peptides from the Library. 

Approximately 20,000 potential candidates were screened as described 
hereinabove, and 21 IPTG-depcndent growth inhibitors were isolated. All the 
inhibitors so identified were able to prevent the growth of the E. coli bacteria at 

15 24 hours, and three of the 21 inhibitors were able to prevent the growth of the E. 
coli bacteria at 48 hours, using the grid patching technique. These three 
inhibitors were classified as "two day" inhibitors; the other 18 were classified as 
"one-day" inhibitors. 

Results from the growth rate analysis for candidate peptide inhibitors are 

20 shown in Table 10. The % inhibition of the growth rate was calculated by 
comparing the growth rates of cells that contained induced peptides with the 
growth rate of cells that contained the induced pLACl 1 vector. Averaged values 
of three independent determinations are shown. 
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Table 10: Ability of the Inhibitor Peptides to Inhibit Cell Growth 



Inhibitor 


Type 


°/» Inhibition 


Inhibitor 


Type 


% Inhibition 


I A Z" 1 1 1 

pLAC 1 1 
(control) 




o 


Ppep 1 i 


1 Day 




pPepl 


1 Hiv 
I 1 \ 


25 


Ppepl2 


2 Day 


82 


nPen2 


1 Day 


23 


Ppepl 3 


1 Day 


28 


P Pep3 


2 Day 


80 


Ppep 14 


2 Day 


71 


pPep4 


1 Day 


21 


Ppepl 5 


1 Day 


23 


pPepS 


1 Day 


24 


Ppcpl6 


1 Day 


24 


pPep6 


1 Day 


27 


Ppepl 7 


1 Day 


28 


pPep7 


1 Day 


26 


PPepl 8 


1 Day 


24 


pPepS 


1 Day 


29 


pPepl9 


1 Day 


29 


pPep9 


1 Day 


22 


pPep20 


1 Day 


19 


pPeplO 


1 Day 


24 


pPep21 


1 Day 


23 



Of the 21 peptides that were tested, the one-day inhibitor peptides 
5 inhibited the bacterial growth rate at a level of approximately 25%, while the 
two-day inhibitor peptides inhibited the bacterial growth rate at levels greater 
than 75%. As can be seen from the hypothetical data in Table 9, a one-day 
inhibitor which inhibited the growth rate at 25% would have only reached an 
OD550 of 0.226 when the control strain reached an OD550 of 0.64. At that point 
1 0 in time, the growth of the culture that is inhibited by a one-day inhibitor (as 
measured by the end-point OD550) will only be only 35.3% of that of a control 
strain at that point; thus the inhibitor peptide would have effectively inhibited the 
growth of the culture by 64.7%. A two-day inhibitor which inhibited the growth 
rate at 75% would have only reached an OD 55f . of 0.028 when the control strain 
1 5 reached an OD550 of 0.64 . Thus the growth of the culture that is being inhibited 
by a two-day inhibitor will only be 4.4% of that of the control strain at this point, 
and the inhibitor peptide w ould have effectively inhibited the growth of the 
culture by 95.6%. These calculations are consistent with the observation that 
tw o-day inhibitors prevent the gr ;..- ;n of bacteria on plates for a full 48 hours 
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while the one-day inhibitors only prevent the growth of bacteria on plates for 24 
hours. 

All 21 candidates were examined using restriction analysis to determine 
whether they contained 66 bp inserts as expected. While most of them did. the 

5 two day inhibitors pPep3 and pPepM were found to contain a huge deletion. 
Sequence analysis of these clones revealed that the deletion had caused the 
carboxy-terniinal end of the inhibitor peptides to become fused to the amino- 
terminal end of the short 63 amino acid Rop protein. The rop gene which is part 
of the ColEl replicon is located downstream from w here the oligonucleotide 

10 library is inserted into the pLACl 1 vector. 



Sequence Analysis of the Top 10 "Anchorless" Inhibitor Peptides. The DNA 

fragments comprising the sequences encoding the top 1 0 "anchorless'' inhibitor 
peptides (i.e., excluding the two Rop fusion peptides) were sequenced, and their 

1 5 coding regions are shown in Table 1 1 . Stop codons are represented by stars, and 
the landmark Bgl II and FxoR I restriction sites for the insert region are 
underlined. Since the ends of the oligonucleotide from which these inhibitors 
were constructed contained these restriction sites, the oligonucleotide was not 
gel isolated when the libraries were prepared in order to maximize the 

20 oligonucleotide yields. Because of this, several of the inhibitory clones were 

found to contain one (n-1) or two (n-2) base deletions in the randomized portion 
of the oligonucleotide. 
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Tabic 11: Sequence analysis of the insert region from the top 10 

inhibitory clones and the peptides that they are predicted to 
encode 



5 ppepl - 13 aa 

GAG . AA AGA T_T AT 3 GTG A A" GAA TTT 7GT g ' TT : 1 A. V- AA A "1 TAA T A ."> T _A AAA G3 

m V T r. ' - - L L i 0 I F • ' * IP N 

GAA. ATT AAT AAG AAT T£ (GEO I ^ ■■ ■ 

10 

pi Pep 5 - ic a a 

:a - ; 1Vv AAA TAT ATG TGG CGG GAG T AA TGG ATT AAG G3T AG- GA T GTG GGG TTT AT G GGT TAA AAT AG 
y W F- - S W I K A F A V G I * - * (SEG I 

15 N;: >f- 

TTG ATA ATA AGA AT_T G , G EC U< N^: 3" 



ppep6 - 42 aa - last 25 aa could form a hydrophobic membrane- 
20 spanning domain 

GAG GAA AGA J.A ATG TGA GGG GGA GAT GTG AAG AGG GAG TGG AAG TGG GGG ATG TGG AAT CGT TGG ATG 

r . S G G H V T F E 3 K G A M G N F. W T 

TAT GTA ATA AGA ATT CTC ATG TTT GAG AGG TTA TGA TGG ATA AGG TT" AAT GGG GTA GTT TAT CAC AGT 

v V I R : L M r D S L S f 1 S F N A V V Y H S 

25 TAA (GEO ID NO: 3S) 
(SEQ in NO: 39) 



30p?ep/-6aa 

GAo GAA AGA J_GT ATG TAT TTG TTT ATG GGA TAA TAG TTA ATG GTG GGG TGG AGA A^T TGA GTT T 

y L F I G * (SFO IT- 1 NO: AO) 
■GAA TT: ( 5 EC IT> NO: 41) 

35 

pPepB - 21 aa 

GA G/vA AGA TGT ATG GTT CTA TTT GGG GGG GAG TGG GGG GAG AAA GG 

y, l :. r o c- r c g g f a 

TGA A" 3 TAA TAA GAA TTG {SEG I [- N G : 42) 
40 G F- II' NO: 4 A 



:: GGA TAG TTT AGT GTG GTA CG>; 

i F T V I. V 



pPeoIC - 2C aa - predicted tc be 4 5' p-sheet -aroino acids 6-1-5 

CA'J A^ AGA TGT ATG ATT GGG GGA TGG TTG A "AG TTG GG 3 TGG GGA ATA GTT T GT AAT AAG AAA T~7 

45 y Z G G 3 L G t A W A ! Y C N r N G I 
GTT TGA :GEC IC NO: 44, 
t GE; IG Nj: 4! J 



50 
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tgi 



pPepI3 -18 aa - predicted to bo- a-r.el:cjl - anno acid? 

3-13 



10 y. : l Z V M C A , K F. r' £ F T F 

G^C GZZ TAA TAA GAA TTC ;3CQ I [' K . : : 4^; 



pPepl7 - 12 a& 



^ s b: o ii' no 



15 



20 

ppepl9 - 3 aa 

CAG GAA AGA TCT ATG AGG GGC GCC AAC TAA GGG GGG GGG AAG GTA TTT GTC CCG TGC ATA ATC TCG GGT 

K R G A N * (SLG Hi N' V 51 i 
GTT GTC TAA. TAA GAA TTC jSBQ II* NG: 53) 

25 

Eight out of the top 10 inhibitors were predicted to encode peptides 
which terminate before the double TAA TAA termination site which was 
engineered into the oligonucleotide. Two of the inhibitors, pPep6 and pPeplO, 
which contain deletions within the randomized portion of the oligonucleotide, 

30 are terminated beyond the EcoR I site. One of the inhibitors, pPepl 7, contains a 
termination signal just after the ATG start codon. However, just downstream 
from this is a Shine Dalgarno site and a GTG codon which should function as 
the start codon. Interestingly, the start sites of several proteins such as Rop are 
identical to that proposed for the pPepl7 peptide (G. Cesareni et al., Proc. Natl. 

35 Acad. Sci. USA. 79:6313-6317 (1982)), The average and median length for the 
8 peptides whose termination signals occurred before or at the double TAA TAA 
termination site was 13 amino acids. 

The characteristics of the predicted coding regions of the inhibitor 
peptides proved to be quite interesting. Three out of the top 1 0 peptides, pPepl. 

40 pPepl3. and pPepl7. contained a proline residue as their last (C-tcrminal) amino 
acid. Additionally, one of the peptides. pPepl2. contained 2 proline residues 
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near the C-terminus, at the n-2 and no positions. Thus there appears to be a bias 
for the placement of proline residues at or near the end of several of the 
inhibitor) peptides. Secondary structure analysis predicted that 3 out of the 10 
peptides contained a known motif that could potentially form a very stable 
5 structure. pPepl3, a peptide containing a C-terminal proline, is predicted to be 
72°/o a-helicaL pPeplO is predicted to be 45% p-sheet. and pPep6 is predicted to 
form a hydrophobic membrane spanning domain. 



Verifying that the Inhibitor} Clones do not Function as Antisensc. To rule 
10 out the possibility that the bioactivity of the inhibitory clones resulted from their 
functioning as antisense RNA or DMA (thus hybridizing to host DNA or RNA) 
rather than by way of the encoded peptides, the insert regions between the Bgl II 
and EcoR I sites for the top five inhibitors from Table 10 were recloned into the 
pLACl 1 vector using oligonucleotides which converted the ATG start codon to 
1 5 an ACG codon thus abolishing the start site . In all five cases the new constructs 
were no longer inhibitory (see Table 12), thus confirming that it is the encoded 
peptides that causes the inhibition and not the DNA or transcribed mRNA. 
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Table 12: Antisensc test of the top 5 "anchorless^ inhibitory peptides from 



Inhibitor) peptide 


0 o inhibition versus 
pLAC I 1 control 


Ant i sense construct 


% inhibition 
versus pLACl 1 
control 


pPepl 


26 


pPepl-anti 


0 


pPepS 


23 


pPep5-anti 


0 


pPepl2 


80 


pPepl2-anti 


0 


pPepl3 


28 


pPepl3-anti 


0 


pPeplO 


29 


pPep 1 9-anti 


0 



5 Growth rates for cells containing the induced inhibitors or antisense constructs 
were determined and then the % inhibition was calculated by comparing these 
values to the growth rate of cells that contained the induced pCyt-3 vector. 

DISCUSSION 

10 

Use of the tightly regulable pLACl 1 expression vector made possible the 
identification of novel bioactive peptides. The bioactive peptides identified 
using the system described in this Example inhibit the growth of the host 
organism (£. coli) on minimal media. Moreover, bioactive peptides thus 

1 5 identified are, by reason of the selection process itself stable in the host's 

cellular environment. Peptides that are unstable in the host cell, whether or not 
bioactive, will be degraded: those that have short half-lives are. as a result, not 
part of the selectable pool. The selection system thus makes it possible to 
identify and characterize novel, stable, degradation-resistant bioactive peptides 

20 in essentially a single experiment. 

The stability of the inhibitory' peptides identified in this Example may be 
related to the presence of certain shared structural features. For example, three 
out of the top 10 inhibitory "anchorless" (i.e.. non-Rop fusion) peptides 
contained a proline residue as their last amino acid. According to the genetic 

25 code, a randomly generated oligonucleotide such as the one used in this Example 
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has only a 6% chance of encoding a proline at a given position, yet the frequency 
of a C-terminal proline among the top ten inhibitory peptides is a full 30%. This 
5-fold bias in favor of a C-terminal proline is quite surprising, because although 
the presence of proline in a polypeptide chain generally protects biologically 

5 active proteins against nonspecific enzymatic degradation, a group of enzymes 
exists that specifically recognize proline at or near the N- and C-termini of 
peptide substrates. Indeed, proline-specific peptidases have been discovered that 
cover practically all situations where a proline residue might occur in a potential 
substrate (D.F. Cunningham et ah, Biochimica et Biophysica Acta 1343:160-186 

10 (1 997)). For example, although the N-terminal sequences Xaa-Pro-Yaa- and 
Xaa-Pro-Pro-Yaa (SEQ ID NO: 54) have been identified as being protective 
against nonspecific N-terminal degradation, the former sequence is cleaved by 
aminopeptidase P (at the Xaa-Pro bond) and dipeptidyl peptidases IV and II (at 
the -Pro-Yaa-bond) ) (Table 5, G. Vanhoof et ah, FASEB J. 9:736-44 (1995); 

15 D.F. Cunningham et ah, Biochimica et Biophysica Acta 1343:160-186 (1997)); 
and the latter sequence, present in bradykinin, interleukin 6, factor XII and 
erythropoietin, is possibly cleaved by consecutive action of aminopeptidase P 
and dipeptidyl peptidase IV (DPPIV), or by prolyl oligopeptidase (post Pro-Pro 
bond) (Table 5, G. Vanhoof et ah, FASEB J. 9:736-44 (1995)). Prolyl 

20 oligopeptidase is also known to cleave Pro-Xaa bonds in peptides that contain an 
N-terminal acyl-Yaa-Pro-Xaa sequence (D.F. Cunningham et ah, Biochimica et 
Biophysica Acta 1343:160-186 (1997)). Other proline specific peptidases acting 
on the N-terminus of substrates include prolidase, proline iminopeptidase and 
prolinase. Prolyl carboxypeptidase and carboxypeptidase P, on the other hand, 

25 cleave C-terminal residues from peptides with proline being the preferred Pi 

residue (D.F. Cunningham et ah, Biochimica et Biophysica Acta 1343:160-186 
(1997). 

Also of interest with respect to the stability of the inhibitory peptides, 
three of the top ten (30%) contained motifs that were predicted, using standard 
30 protein structure prediction algorithms, to form stable secondary structures. One 
of the peptides (which also has a C-terminal proline) was predicted to be 72% a- 
helical. Another was predicted to be 45% p-sheet: this peptide may dimerize in 
order to ef'V *;t the hydror_t;n bonding necessary to form the p-sheet. A third was 



WO 00/22112 




PCT/US99/23731 



61 



predicted to possess a hydrophobic membrane spanning domain. According to 
these algorithms (see. e.g., P. Chou et al.. Adv. Enzymol. 47:45-148 (1978); J. 
Gamier et ah, J. Mol. Biol. 120:97-120 ( 1978); P. Chou. "Prediction of protein 
structural classes from amino acid composition." In Prediction of Protein 
5 Structure and the Principles of Protein Conformation (Fasman. G.D. ed.). 
Plenum Press, New York, N.Y. 549-586 ( 1 990); P. Klein et al.. Biochim. 
Biophys. Acta 815:468-476 (1985)), a randomly generated oligonucleotide such 
as the one used in our studies would have had no better than a 1 in a 1000 chance 
of generating the motifs that occurred in these peptides. 

10 Finally, two of the three two-day inhibitors proved to be fusion peptides 

in which the carboxyl terminus of the peptides was fused to the amino terminus 
of the Rop protein. Rop is a small 63 amino acid protein that consists of two 
antiparallel a-helices connected by a sharp hairpin loop. It is a dispensable part 
of the ColEl replicon which is used by plasmids such as pBr322, and it can be 

1 5 deleted without causing any ill-effects on the replication, partitioning, or copy 
numbers of plasmids that contain a ColEl ori (X. Soberon, Gene. 9: 287-305 
(1980). Rop is known to possess a highly stable structure (W. Eberle et al., 
Biochem. 29:7402-7407 (1990); S. Betz et ah, Biochemistry 36:2450-2458 
(1997 )), and thus it could be serving as a stable protein anchor for these two 

20 peptides. 

Table 13 lists naturally occurring bioactivc peptides whose structures 
have been determined. Most of these peptides contain ordered structures, further 
highlighting the importance of structural stabilization. Research on developing 
novel synthetic inhibitor} 7 peptides for use as potential therapeutic agents over 

25 the last few years has shown that peptide stability is a major problem that must 
be solved if designer synthetic peptides are to become a mainstay in the 
pharmaceutical industry (J. Bai et al.. Crit. Rev. 1 her. Drug. 12:339-371 (1995): 
R. Egleton Peptides. 18:1431-1439 (1997); L. Wearlcy, Crit Rev Ther Drug 
Carrier Syst. 8: 331-394 (1991). The system described in this Example 

30 represents a major advance in the art of peptide drug development by biasing the 
selection process in favor of bioactive peptides that exhibit a high degree of 
stability in an intracellular environment. 
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Table 13: Structural motifs observed in naturally occurring bioactive 
peptides 



Bioactive 


Size in 


Structural 


Reference 


Peptide 


Amino acids 


Motif 




Dermaseptin 


34 


u-helix 


34 


Endorphin 


30 


a-helix 


n 


Glucagon 


29 


a-helix 


6 


Magainins a 


23 


a- helix 


S 


Mastoparan 


14 


a-helix 


1 1 


Melittin 


26 


a- helix 


44 


Motiiin 




a-helix 


25 


PK1 (5-24) 


20 


a-helix 


38 


Secretin 


27 


a-helix 


8 


Atrial Natriuretic Peptide 


28 


disulfide bonds 


33 


Calcitonin 


32 


disulfide bonds 


4 


Conotoxins a 


10-30 


disulfide bonds 


37 


Defensins a 


29-34 


disulfide bonds 


30 


EETI II 


29 


disulfide bonds 


23 


Oxytocin 


9 


disulfide bonds 


45 


Somatostatin 


14 


disulfide bonds 


35 


Vasopressin 


9 


disulfide bonds 


20 


Bombesin 


14 


disordered 


12 


Hi stat in 


24 


disordered 


51 


Substance P 


1 1 


disordered 


50 



a These peptides belong to multi-member families. 
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Example III 

Directed synthesis of stable synthetically engineered inhibitor peptides 

5 These experiments were directed toward increasing the number of 

bioactive peptides produced by the selection method described in Example II. In 
the initial experiment, randomized peptides fused to the Rop protein, at either the 
N- or C- terminus, were evaluated. In the second experiment, nucleic acid 
sequences encoding peptides containing a randomized internal amino acid 
10 sequence flanked by terminal prolines were evaluated. Other experiments 
included engineering into the peptides an a-helical structural motif, and 
engineering in a cluster of opposite charges at the N- and C-termini of the 
peptide. 

15 

MATERIALS AND METHODS 

Media. Rich LB and minimal M9 media used in this study was prepared as 
described by Miller (see Example I). Ampicillin was used in rich media at a 
20 final concentration of 100 ug/ml and in minimal media at a final concentration of 
50 ug/ml. IPTG was added to media at a final concentration of 1 mM. 

Chemicals and Reagents. Extension reactions were carried out using Klenow 
from New England Biolabs (Bedford. MA) while ligation reactions were 
25 performed using T4 DKA ligase from Life Sciences (Gaithersburg, MD) 

Alkaline phosphatase (calf intestinal mucosa) from Pharmacia (Piscataway, NJ) 
was used for dephosphorylation. IPTG was obtained from Diagnostic Chemicals 
Limited (Oxford, CT). 

30 Bacterial Strains and Plasmids. ALS225, which is MC\06\ fF'Iacfl ] Z+Y+A+i 
was the E. coli bacterial strain used in this study (see Example I). The genotype 
for MC 1 06 1 is araD139 MaraABOIC-lew 76~9 A(1ac)X~4 gal U gal K rpsL hsr- 
hsm+ previously described. pLACl 1 (Example I), a highly regulable 
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expression vector, was used to make p-Rop(C) and p(N)Rop- fusion vectors as 
well as the other randomized peptide libraries which are described below. 



Construction of the p-Rop(C) Fusion Vector. The forward primer 5* TAC 
5 TAT AGA TCT ATG ACC AAA CAG GAA AAA ACC GCC 3' (SEQ ID NO: 
55) and the reverse primer 5' TAT ACG TAT TCA GTT GCT CAC ATG TTC 
TTT CCT GCG 3' (SEQ ID NO: 56) were used to PGR amplify a 558 bp DNA 
fragment using pBR322 as a template. This fragment contained a Bgl II 
restriction site which was incorporated into the forward primer followed by an 
10 ATG start codon and the Rop coding region. The fragment extended beyond the 
Rop stop codon through the Afl III restriction site in pBR322. The amplified 
dsDNA was gel isolated, restricted with Bgl II and All III, and then ligated into 
the pLACl 1 expression vector which had been digested with the same two 
restriction enzymes. The resulting p-Rop(C) fusion vector is 2623 bp in size 
15 (Eig. 7). 

Construction of the p(N)Rop- Fusion Vector. The forward primer 5' AAT 
TCA TAC TAT AGA TCT ATG ACC AAA CAG GAA AAA ACC GC 3' 
(SEQ ID NO: 57) and the reverse primer 5* TAT ATA ATA CAT GTC AGA 

20 ATT CGA GGT TTT CAC CGT CAT CAC 3* (SEQ ID NO: 58) were used to 
PCR amplify a 201 bp DNA fragment using pBR322 as a template. This 
fragment contained a Bgl II restriction site which was incorporated into the 
forward primer followed by an ATG start codon and the Rop coding region. The 
reverse primer placed an EcoR I restriction site just before the Rop TGA stop 

25 codon and an Afl III restriction site immediately after the Rop TGA stop codon. 
The amplified dsDNA was gel isolated, restricted with Bgl II and Afl III, and 
then ligated into the pLACl 1 expression vector which had been digested with 
the same two restriction enzymes. The resulting p(N)Rop- fusion vector is 2262 
bp in size (Tig. 8). 

30 

Generation of Rop Fusion Randomized Peptide Libraries. Peptide 
libraries were constructed as described in Example II. The synthetic 
oligonucleotide 5' TAC TAT AGA TCT ATG (NNN)20 CAT AGA TCT GCG 
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TGC TGT GAT 3' (SEQ ID NO: 59) was used to construct the randomized 
peptide libraries for use with the p-Rop(C) fusion vector, substantially as 
described in Example II. The complementary strand of this oligonucleotide was 
generated by a fill-in reaction with Klenow using an equimolar amount of the 
5 oligonucleotide primer 5' ATC ACA GCA CGC AGA TCT ATG 3 1 were used 
(SEQ ID NO: 60). After extension the resulting dsDNA was digested with Bgl 
II and ligated into the pLACl 1 expression vector which had been digested with 
the same restriction enzyme and subsequently dephosphorylated using alkaline 
phosphatase. Because of the way the oligonucleotide library has been 

10 engineered, either orientation of the incoming digested double-stranded DNA 
fragment results in a fusion product. 

To construct the randomized peptide libraries for use with the p(N)Rop- 
fusion vector, the randomized oligonucleotide 5' TAC TAT GAA TTC (NNN)20 
GAA TTC TGC CAC CAC TAC TAT 3' (SEQ ID NO: 61), and the primer 5' 

1 5 ATA GTA GTG GTG GCA GAA TTC 3' (SEQ ID NO: 62) were used. After 
extension the resulting dsDNA was digested with EcoRI and ligated into the 
pLACl 1 expression vector which had been digested with the same restriction 
enzyme and subsequently dephosphorylated using alkaline phosphatase. Because 
of the way the oligonucleotide library has been engineered, either orientation of 

20 the incoming digested double-stranded DNA fragment results in a fusion 
product. 

Generation of a Randomized Peptide Li bran' Containing Terminal 
Prolines. Randomized 20 amino acid peptide libraries containing two proline 

25 residues at both the amino and the carboxy terminal ends of the peptides were 
constructed using the synthetic oligonucleotide 5' TAC TAT AGA TCT ATG 
CCG CCG (NNN)16 CCG CCG TAA TAA GAA TTC GTA CAT 3' (SEQ ID 
NO: 63 ). The complementary strand of the 93 base randomized oligonucleotide 
was generated by filling-in with Klenow using the oligonucleotide primer 5' 

30 ATG TAC GAA TTC TTA TTA CGG CGG 3' (SEQ ID NO: 64). After 

extension the resulting dsDNA was digested with Bgl II and EcoR I and ligated 
into the pEACl 1 expression vector which had been digested with the same two 
rc>triction enzymes. Because the initiating methionine of the peptides coded by 
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this library is followed by a proline residue, the initiating methionine will be 
removed (F. Sherman et al. Bioessays 3: 27-31 (1085)). Thus the peptide 
libraries encoded by this scheme are 20 amino acids in length. 

5 Generation of a Randomized Hydrophilic ex-Helical Peptide Library. Table 
14 shows the genetic code highlighted to indicate certain amino acid properties. 



Table 14: Genetic Code Highlighted to Indicate Amino Acid Properties 



TIT 


phe 


ha 


TCT 


ser 




TAT 


tyr 


b a 


TGT 


cys 




TTC 


phe 


ha 


TCC 


ser 




TAG 


tyr 


h a 


TGC 


cys 




71 A 


leu 


H a 


TCA 


ser 




TAA 


OCH 




TGA 


OPA 




TTG 


leu 


H a 


TCG 


ser 




TAG 


AMB 




TGG 


tip 




CTT 


leu 


Ha 


CCT 


pro 


B a 


CAT 


his 


ha 


CGT 


arg 




CTC 


leu 


Ha 


CCC 


pro 


B a 


CAC 


his 


h a 


CGC 


arg 




CTA 


leu 


H a 


CCA 


pro 


B a 


CAA 


gin 


ha 


CGA 


arg 




CTG 


leu 


H a 


CCG 


pro 


B a 


CAG 


gin 


ha 


CGG 


arg 




ATT 


ile 


ha 


ACT 


thr 




AAT 


asn 


ba 


AGT 


ser 




ATC 


ile 


h a 


ACC 


thr 




AAC 


asn 


ba 


AGC 


ser 




ATA 


ile 


h a 


AC A 


thr 




AAA 


lys 


ha 


AGA 


arg 




ATG 


met 


H a 


ACG 


thr 




AAG 


lys 


h a 


AGG 


arg 




GTT 


val 


ha 


GCT 


ala 


H a 


GAT 


asp 


ha 


GGT 


gly 


B a 


GTC 


val 


ha 


GCC 


ala 


H a 


GAC 


asp 


ha 


GGC 


gly 


B a 


GTA 


val 


h a 


GCA 


ala 


Ha 


GAA 


glu 


Ha 


GGA 


gly 


B a 


GTG 


val 


ha 


GCG 


ala 


H a 


GAG 


glu 


Ha 


GGG 


gl> 


B a 



Boldface amino acids are hydrophobic while italicized amino acids are 
10 hydrophilic. The propensity for various amino acids to form a-helical structures is 
also indicated in this table using the conventions first described by Chou and 
Fasman (P. Chou et <x\..Adv. Enzymol. 47:45-148 (1978)). H a = strong a-helix 
former, h a = a-helix former, B a = strong cx-helix breaker, b a = a-helix breaker. The 
assignments given in this table are the consensus agreement from several different 
1 5 sources. Hydrophilic versus hydrophobic assignments for the amino acids were 
m-jcic from data found in Wolfenden et. al. (Biochemistry. 20:849-55 (19? I)): 
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Miller et. al. U Mol Biol. 196:641-656 (1987)): and Roscman (J. Mol Biol 
200:5 1 3-22( 1 988 ) ). The propensity for amino acids to form a-helical structures 
were obtained from consensus agreements of the Chou and Fasman ( P. Chou et ah. 
Adv. Enzvmol 47:45-148 (1978); I\ Chou, "Prediction of protein structural 
classes from amino acid compositions." in Prediction of protein structure and 
the principles of protein conformation (G. Fasman. G.D. ed.). Plenum Press, 
New York, N.Y. 549-586 (1990)); Gamier, Osguthorpe. and Robson U Mol 
Biol. 120:97-120 (1978)); and O'Neill and DeGrado (Science. 250:646-651 
(1990)) methods for predicting secondary structure. 

By analyzing the distribution pattern of single nucleotides in the genetic 
code relative to the properties of the amino acids encoded by each nucleotide 
triplet, a novel synthetic approach was identified that would yield randomized 18 
amino acid hydrophilic peptide libraries with a propensity to form a-hchces. 
1 5 According to Table 14, the use of a [ ( C AG ) A (TC A G ) ] codon mixture yields the 
hydrophilic amino acids His. Gin, Asm Lys, Asp, and Glu. These amino acids are 
most often associated with a-helical motifs except for asparagine. which is 
classified as a weak a-helical breaker. If this codon mixture was used to build an 
a-helical peptide, asparagine would be expected to occur in about 17% of the 
20 positions, which is acceptable in an a-helical structure according to the secondary' 
structure prediction rules of either Chou and Fasman (P. Chou et al.. Adv. 
Enzymol 47:45-148 (1978); P. Chou, "Prediction of protein structural classes 
from amino acid compositions," in Prediction of protein structure and the 
principles of protein conformation (G. Fasman. G.D. ed.). Plenum Press, New 
25 York, N.Y. 549-586 ( 1 990)) or Gamier. Osguthorpe. and Robson (J. Gamier et 
al.. J. Mol Biol 120:97-120 (1978)). Additionally, several well-characterized 
proteins have been observed to contain up to three b a breaker amino acids within a 
similarly sized a-helical region of the protein (T. Creighton, "Conformational 
properties of polypeptide chains," in Proteins: structures and molecular 
30 properties. \Y.H. Freeman and Company, N.Y., 182-186 (1993)). Since in most 
a-heliccs there are 3.6 amino acids per complete turn, the 1 8 amino acid length was 
chosen in order to generate a-helical peptides which contained 5 complete turns. 
Moreover, the use of hydrophilic amino acids would be expected to yield peptides 
which are soluble in the cellular cytosol. 
35 Randomized 18 amino acid hydrophilic a-helical peptide libraries were 

svnthes^d usinn the synthetic oligonucleotide 5' TAC TAT AGA 1CT ATG 
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(VAN)r TAA TAA GAA TIC TGC CAG CAC TAT ?' (SEQ ID NO: 65). The 
complementary strand of the 90 base randomized oligonucleotide was generated 
by filling-in with Klenow using the oligonucleotide primer 5' ATA GTG CTG 
GCA GAA TTC TTA TTA 3' (SEQ ID NO: 66). After extension the resulting 
5 dsDNA was digested with Bgl II and EcoR I and ligated into the pLACl 1 
expression vector which had been digested with the same two restriction 
enzvmes. 



Generating a Randomized Peptide Library Containing the +/- Charge 
10 Ending Motif. Randomized peptide libraries stabilized by the interaction of 
oppositely charge amino acids at the amino and carboxy termini were generated 
according to the scheme shown in Fig. 9. To maximize the potential interactions 
of the charged amino acids, the larger acidic amino acid glutamate was paired with 
the smaller basic amino acid lysine, while the smaller acidic amino acid aspartate 
1 5 was paired with the larger basic amino acid arginine. To construct the randomized 
peptide libraries, the synthetic oligonucleotide 5' TAC TAT AGA TCT ATG 
GAA GAC GAA GAC (NNN), 6 CGT AAA CGT AAA TAA TAA GAA TTC 
GTA CAT 3 f (SEQ ID NO: 67) and the oligonucleotide primer 5' ATG TAC 
GAA TTC TTA TTA TTT ACG TTT ACG 3' (SEQ ID NO: 68) were used. 
20 After extension the resulting dsDNA was digested with Bgl II and EcoR I and 
ligated into the pLACl 1 expression vector which had been digested with the 
same two restriction enzymes. 



For all libraries of randomized oligonucleotides, N denotes that an 
25 equimolar mixture of the four nucleotides A, C, G, and T was used, and V 

denotes that an equimolar mixture of the three nucleotides A, C and G was used. 
The resulting libraries were transformed into electrocompetent ALS225 E. coli 
cells (Example I) under repressed conditions as described in Example II. 

30 Screening of Transformants to Identify Inhibitor Clones. Transformants 
were initially screened using the grid-patching technique to identify any that 
could not grow on minimal media as described in Example II when the peptides 
were overproduced. To verify that all the inhibitors were legitimate, plasmid 



WO 00/22112 




PCT/US99/23731 



69 



DNA was made from each inhibitory clone, transformed into a fresh 
background, then checked to make sure that they were still inhibitory on plates 
and that their inhibition w as dependent on the presence of the inducer. IPTG. as 
in Lxample II. 

Growth Rate Analysis in Liquid Media. Inhibition strength of the peptides 
was assessed by subjecting the inhibitory clones to a growth rate analysis in 
liquid media. Minimal or rich cultures containing either the inhibitor to be tested 
or the relevant vector as a control were diluted to an initial OD550 of 
approximately 0.01 using new media and induced with 1 mM IPTG. OD550 
readings were then taken hourly until the cultures had passed log phase. Growth 
rates were determined as the spectrophotometric change in OD550 per unit time 
within the log phase of growth, and inhibition of the growth rate was calculated 
for the inhibitors using the appropriate vector as a control. 

RESULTS 

Isolation and Characterization of Inhibitor Peptides that are Fused at Their 
Carboxy Terminal End to the Amino Terminal End of the Rop Protein. 

20 Approximately 1 0,000 peptides protected by the Rop protein at their carboxy 
terminal end were screened using the grid-patching technique described in 
Example II, and 16 two day inhibitors were isolated. The inhibitory effects were 
determined as described in the Example II, using pRop(C) as a control. Unlike 
the anchorless inhibitors identified in Example II that were only inhibitory on 

25 minimal media, many of the Rop fusion inhibitors were also inhibitory on rich 
media as well, which reflects increased potency. As indicated in Table 15, the 
inhibitors inhibited the bacterial growth rate at levels that averaged 90% in 
minimal media and at levels that averaged 50% in rich media. The data in Table 
15 is the average of duplicate experiments. 

30 



10 
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Table 15: Inhibitory effects of peptide inhibitors stabilized by fusing the 
carboxy terminal end of the peptide to the amino terminal 
end of the Hop protein (Rop(C) fusion peptide inhibitors 



Inhibitor 


% inhibition in 


°o inhibition in 




minimal media 


rich media 


PRop(C')I 


87 


47 


PRop(C)2 


99 


58 


PRop('C)3 


85 


54 


PRop(C)4 


98 


49 


PRop(Cj5 


95 


54 


PRop(C)6 


99 


46 


PRop(C)7 


91 


59 


PRop(C)8 


86 


51 


PRop(C)9 


93 


57 


PRop(C)10 


91 


35 



5 



Isolation and Characterization of Inhibitor Peptides that are Fused at Their 
Amino Terminal End to the Carboxy Terminal End of the Rop Protein. 

Approximately 6000 peptides protected at their amino terminal end by Rop 
10 protein were screened using the grid-patching technique described in Example II, 
and 14 two day inhibitors were isolated. As observed for the Rop fusion 
peptides isolated using the p-Rop(C) vector, most of the inhibitor peptides 
isolated using the p(N)Rop- vector were inhibitory on rich media as well as 
minimal media. The inhibitors were verified as described hereinabove and 
15 subjected to growth rate analysis using p(N)Rop- as a control in order to 

determine their potency. As indicated in Table 16, the inhibitors inhibited the 
bacterial growth rate at levels that averaged 90% in minimal media and at levels 
that averaged 40% in rich media. The data in Table 16 is the average of 
duplicate experiments. 

20 
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Table 16: Inhibitory effects of peptide inhibitors stabilized by fusing the 
amino terminal end of the peptide to the carboxy terminal 
end of the Rop protein (Rop(N) fusion peptide inhibitors) 



Inhibitor 


0 o inhibition in 


°o inhibition in 




minimal media 


rich media 


pRop(N)l 


81 


30 


pRop(N)2 


c >6 


53 


pRop(N)3 


95 


43 


pRop(N}4 


92 


38 


pRopCNMS 


99 


"> 


pRop<N>6 


93 




pRop(N)7 


87 




pRop(N)8 


01 


44 


pRop(N)9 


95 


3 7 


pRop(N)10 


96 


40 



Isolation and Characterization of Anchorless Inhibitor Peptides Containing 
Two Prolines at Both Their Amino Terminal and Carboxy Terminal Ends. 

Approximately 7500 peptides were screened using the grid-patching technique 
10 described in Example II, and 12 two day inhibitors were isolated. As indicated 
in Table 1 7, the top ten inhibitors inhibited the bacterial growth rate at levels that 
averaged 50% in minimal media. The inhibitory effects were determined as 
described in the text using pLAC 1 1 as a control. The data in Table 1 7 is the 
average of duplicate experiments. 

15 
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Table 17: Inhibitory effects of peptide inhibitors stabilized by two 
proline residues at both the amino and carboxy terminal ends of the peptide 



Inhibitor 


% inhibition in 
minimal media 


pProl 


M) 


pPro2 


A O 


pPro3 


c r\ 
MJ 


prro4 




pPro5 


52 


pPro6 


93 


pPro7 


54 


pProS 


42 


pPro9 


41 


pProlO 


42 



5 Sequence analysis of the coding regions for the top ten inhibitors is 

shown in Table 19. The landmark Bgl II and EcoR I restriction sites for the insert 
region are underlined, as are the proline residues. 

Since the ends of the oligonucleotide from which these inhibitors were 
constructed contained Bgl II and EcoR I restriction sites, the oligonucleotide was 

1 0 not gel isolated when the libraries were prepared in order to maximize the 
oligonucleotide yields. Because of this, three of the inhibitory clones, pPro2 s 
Ppro5, and pPro6 were found to contain deletions in the randomized portion of the 
oligonucleotide. 



15 



Table 18: 
peptides 



Sequence analysis of the insert region from the proline 
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K I P F L I I V S G I t O o h i. r. _ r - 

AA7 T3T CAT GTT TGA ( SEQ ID N3 : 71: 
N S n V * (SEQ ID NO: 72; 

5 

pPro3-8aa 

A_3A TCT ATC CCG CCG GAC AAT C3G GTC CTG TGA TGA A3: G 3.--. G^T CCA CCA A3 3 GGA 7 A". CA3 C-G CC- 7 A.-. T/---. 
1 0 GAA 77C ( £ CO ID ND: 7 3 ) 

pPro4 - 9aa 

AGA TCT ATG CCG CCG CTA TTG GAC GGA GA7 GA3 AAA. TAG ATA TAT GCG TG3 TTC TTT TTC TGT C3G CCu- T.-_-. T.~--. 

X f F L L V G D l Y * i £ E ~ ID NC: 7 £ : 
15 GAA T73 (SEQ 13 NO: 75 J 

pPro5 - lOaa 

AGA. TCT ATG CCG CCG AGG TGG AAG ATG TTG ATA AGA CA3 TGA CAo ATo CG? TCC ATT AC. CCC G^ G..~. A^ 
M p P R W K M L I F. 0 * (SEQ ID NO: 75! 
20 ATT C (SEQ ID NO: 77) 

pPro6 - 7aa 

AG A TCT ATG ATG AGA GTA GCG CCG CCG 7AA TAA GAA TTC (SEC ID NO: 7 9; 
25 >: M R V A I F * (SCO 13 Nj: EC; 

pPro7 - 14aa 

AGA TCT AT 3 CCG CCG TTG CGC GGG GCA TGC GAT GTA TAT GGG GTA AAT TGA ATG TCT TGT GG3 CCG CCG TAA TAA 

M F P l R G A C D V Y G V N * <SEQ ID ND: 82) 

30 GAA TTC (SEQ ID~NO: 81) 

pPro8 - 21aa 

AGA TC^ ATG CCG CCG GGG AGA GGG GAA GCG GTG GGA. GTG ACA T3C TTG AGC GCG AAC GTG TAC CCG CCG TAA T.A 
M P F G R G E A V G V T C L S A N V Y P P * 

35 (SEO ID NO: bA ) 

GAA TTC (SEO ID NO: 82) 

pPrc»9 - 21aa 

AGA TCT ATG CCG CCG GGA AGG GTA GTG TTC TTT GTC GCT ATC TTT GTT TCC GCA ATA TGC CTC CCG CCG TAA ,«A 

40 mppgRVVFFVAIFVSAICLPP* 

(SEQ ID NO: 86) 
GAA TTC (SEQ ID NO: 85) 

pProlO - 21aa 

45 AGA TC- ATG CCG CCG AGG TTC GCT CAT GAG AGT GTT AAA GGG CTG GGG GAC GTT ACA AAA GCT CCG CCG ^ 
• MPPR FAHESVKGLGDVTKArr* 

{SEQ ID NO: 86) 

GAA TTC t SEQ ID NO: 87} 

50 All the inhibitors were found to contain two proline residues at either their amino 
or carboxy termini as expected. Four inhibitors contained two proline residues at 
both their amino and carboxy termini, five inhibitors contained two proline 
residues at only their amino termini, and one inhibitor contained two proline 
residues at only its carboxy terminus. 
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Isolation and Characterization of Anchorless Hvdrophilic Inhibitor 
Peptides Stabilized by an a-Helical Motif. Approximately 12.000 peptides were 
screened using the grid-patching technique and 5 two day inhibitors were isolated. 
5 The inhibitors were verified as already described for the Rop-peptide fusion studies 
and subjected to growth rate analysis using pLACl 1 as a control in order to 
determine their potency. As indicated in Table 19, the inhibitor peptides inhibited 
the bacterial growth rate at levels that averaged 50% in minimal media. The 
averaged values of two independent determinations are show n. 

10 

Table 19: Inhibitory effects of the hvdrophilic a-helical peptides 



Inhibitor 


% inhibition in 
minimal media 


pHelixl 


67 


pHelix2 


46 


pHelix3 


48 


pHelix4 


45 


pHelix5 


42 



Sequence analysis of the coding regions for the 5 inhibitors is shown in 
1 5 Table 20. The landmark Bgl II and EcoR I restriction sites for the insert region are 
underlined. Since the ends of the oligonucleotide from which these inhibitors were 
constructed contained these restriction sites, the oligonucleotide was not gel 
isolated when the libraries were prepared in order to maximize the oligonucleotide 
vields. Because of this, two of the inhibitory clones, pHelix2 and pHelix3, were 
20 found to contain deletions in the randomized portion of the oligonucleotide. The 
predicted a-helical content of these peptides is indicated in Table 20 according to 
the secondary structure prediction rules of Gamier, Osguthorpe. and Robson (J. 
Gamier et ah. J. Mo I. Bio!. 120:97-1 20 (1978)) prediction Riles. 
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Table 20: Sequence analysis of the insert region from the hydrophilic a- 
helical peptides 

pHelixl - IBaa, 83% a-helical 



10 



pHeli>;2 - 22aa, 68% a-helical 



15 A"J. T TT AT 3 TAG CAG GAG CAT GAG CAA GCC AGG A - \j AGC A.-. 3 A3- AT.", AAG AAT AA.T AAT AAT TCT 



pHelixB - 22aa, 55% a-helical 



AGA TCT ATG AAC CAT TAT AAT GA.G GCC ATG ATG AAC AC 



"AT TCT CAT 



25 ~~ M N H H N E A M I N T M K T k N N K N S H 

GTT TGA (SE'J I D NU: a 3) 
V * (SEC' It' NO: V. : 



30 pHeli>:4 - 18aa, 17% a-helical 

AGA TCT ATG AAC GAT GAC AAT GAG CAA GAG GAT AAT CAT GAT CAG TAT AAG GA.T AAC AAA TAA TAA GAA 

M N r b N 0 L> E E t- H b 0 H K D N K (SEQ 

ID MO: 96) 
35 TTC iSZQ ID NO: 93; 



pHeiix5 - 18aa, 50% a-heiical 



40 AGA TCT ATG CAA GAG CAG GAT TAG CAT AAT GAT AAC CAT CAT uAG GAT AAA TAT AAo AA-.j TAA TAA. GAA 
^ r : r -,j I C H H l< U H H E I 1 K H K K * * (SEQ 

i:.: NO: 98 



45 

According to Gamier, Osguthorpe. and Robson secondary structure prediction, all 
of the encoded peptides are expected to be largely a-helical except for pHelix4. 
Interestingly, pHelixl which had the highest degree of a-helical content was also 
the most potent inhibitory peptide that was isolated in this stud)'. 

50 
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Isolation and Characterization of Anchorless Inhibitor Peptides Stabilized 
by an Opposite Charge Ending Motif. Approximately 20.000 peptides were 
screened using the grid-patching technique and 6 two day inhibitors were isolated. 
The inhibitors were verified as already described for the Rop-peptide fusion studies 
and subjected to growth rate analysis using pLACl 1 as a control in order to 
determine their potency. As indicated in Table 2 1 . the inhibitor peptides inhibited 
the bacterial growth rate at levels that averaged 50% in minimal media. The 
averaged values of two independent determinations are shown. 

Tabic 21: Inhibitory effects of peptide inhibitors that are stabilized by the 
opposite charge ending motif 



Inhibitor 


% inhibition in 
minimal media 


P+/-1 


41 


P+/-2 


43 


P+/-3 


48 


P+/-4 


60 


P+/-5 


54 


P+/-6 


85 



Sequence analysis of the coding regions for the six inhibitors is shown in 
Table 22. The landmark Bgl II and EcoR I restriction sites for the insert region 
are underlined. With the exception of pV-4 which was terminated prematurely, 
the coding regions for the inhibitors were as expected based on the motif that 
was used to generate the peptide libraries. 
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Tabic 22: Sequence analysis of the insert region from the opposite charge 
ending peptides 



5 p+/-l - 2 5aa 



10 



15 



20 



P+/-2 - 25aa 



|A T "T ATG GAA GAC GAA 

M ~ L E 

:T A/---. CGT AAA. TAA TAA 

K P H 4 * 



P+/-3 - 25aa 



GGT CTA GGC ATG GGG GJ 
G L G M 3 G 
TT_r (SEO I - N 0 : 101 ■ 



:gc- t:g 



AG CTC AC? TTA 



A 3A TCT ATG GAC GAA 

" M E D E 

.5 CGT AAA CGT AAA TAA TAA 

R K R K 



GAC GGG GAG AGG ATC CAC 

:■■ j e r i j 

GAA TTC (5EQ ID NO: 10 j 

{ SEQ ID NO: 10-1 ) 



CTG GTA GAT 
L Y L 



30 



P+/-4 - llaa 



AGA TCT ATG GAA GA.C -GAA GAC GAC AGG GGG CGT GGG CG 

M E L £ D l; R G P. G R 
CGT AAA CGT AAA TAA TAA GAA TTC { SEQ ID NO: ICC; 



; CTT TA.-. GTT GGG CTA AGT TGG GAG AT 
;CEG ID NO: ICG) 



35 p+/-5 



Jbaa 



AGA TCT ATG GAA GAC GAA GAG GGG GGG GCC GGG AGG AC 

M E D E 0 G G A G P r 

CGT AAA CGT AAA TAA TAA GAA TJTG (SEC ID NO: 1 C 7 : 

40 R K K K • (GEO- ID NO: 106) 



TGI CTT TG 
L C 



3CG CTT GT 



P+/-6 - 25aa 



45 



AGA TCT ATG GAA 

M r. 
CGT A/A CGT AAA 
R K P K 



ga: AAG CGT CGC 
:■ k ?. R 

TGT (SEO ID NO: 



50 DISCUSSION 



In Example II. where fully randomized peptides were screened for 
inhibitor}' effect, only three peptides (one "anchorless" and two unanticipated 
Rop fusions resulting from deletion) were identified out of 20.000 potential 
55 candidates as a potent (i.e.. two day) inhibitor of E. call bacteria. Using a biased 
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synthesis as in this Example, it was possible to significantly increase the 
frequency of isolating potent growth inhibitors (see Table 23). 



Table 23: Summary of the frequency at w hich the different types of 
5 inhibitor peptides can be isolated 



Type of inhibitor peptide 


Frequency at which a two day 
inhibitor peptide can be isolated 


Reference 


anchorless 


1 in 20.000 


Example II 


protected at the C-terminal 
end via Rop 


1 in 625 


This example 


protected at the N-terminal 
end via Rop 


1 in 429 


This example 


protected at both the C- 
terminal and N-tcrminal end 
via two prolines 


1 in 625 


This example 


protected with an a-helix 
structural motif 


1 in 2,400 


This example 


protected with an opposite 
charge ending motif 


1 in 3,333 


This example 



Many more aminopeptidases have been identified than carboxypeptidases 
in both prokaryotic and eukaryotic cells (J. Bai, et ah, Pharm. Res. 9:969-978 

10 (1992 ); J. Brownlees et ah, J. Neurochem. 60:793-803 (1993); C. Miller, In 
Escherichia coli and Salmonella typhimurium cellular and molecular biology, 
2nd edition (Ncidhardu F.C. ed.), ASM Press, Washington, D.C 1:938-954 
( 1996)). In the Rop fusion studies, it might have therefore been expected that 
stabilizing the amino terminal end of the peptide would have been more effective 

15 at preventing the action of exopeptidases than stabilizing the carboxy end of the 
peptides. Surprisingly, it was found that stabilizing either end of the peptide 
caused about the same effect. 

Peptides could also be stabilized by the addition of two proline residues 
at the amino and/or carboxy termini, the incorporation of opposite charge ending 

20 amino acids at the amino and carboxy termini, or the use of helix-generating 
hydrophilic amino acids. As shown in Table 23, the frequency at which potent 
inhibitor peptides could be isolated increased significantly over that of the 
anchorless peptides characterized in example II. 
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These findings can be directly implemented to design more effective 
peptide drugs that are resistant to degradation by peptidases. In this example, 
several strategies were shown to stabilize peptides in a bacterial host. Because 
the aminopeptidases and carboxypeptidascs that have been characterized in 

5 prokaryotic and eukaryotic systems appear to function quite similarly (C. Miller, 
In Escherichia coli and Salmonella typhimuriam cellular and molecular biology, 
2nd edition (T\ T eidhardt, F.C. ed.), ASM Press, Washington, D.C. 1 :938-954 
(1996); N. Rawlings et ah, Biochem J. 290:205-218 (1993)), the incorporation of 
on or more of these motifs into new or known peptide drugs should slow or 

10 prevent the action of exopeptidases in a eukaryotic host cell as well. 

Sequence Listing Free Text 

SEQIDNO:2 

15 peptide sequence having opposite charge ending motif 

SEQIDNOs:3-4 
stabilized angiotensin 

20 SEQ ID NOs:6-19, 24-28, 55-58, 60, 62, 64, 66, 68 
primer 

SEQIDNOs:20-22 
primer fragment 

25 

SEQ ID NOs:23, 59, 61. 63, 65. 67 
randomized oligonucleotide 



SEQIDNOs:29-33 
30 antisense oligonucleotide 
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SHQ ID NOs:34. 36. 39, 40. 43. 45. 46. 48, 51. 52 : 70. 72. 74. 76. 78. 80. 82, 84. 
86. 88. 90. 92, 94. %. 98, 100, 102. 104, 105. 10S, 110 
stabilized peptide 

5 SEQ ID NOs:35, 37, 38. 41, 42, 44. 47, 49, 50. 53, 69. 71. 73, 75. 77, 79, 81, 83. 
85. 87. 89. 91, 93. 95, 97. 99, 101. 103. 106, 107. 109 
nucleic acid encoding stabilized peptide 

SEQIDNO:54 
10 N-terminal protective sequence 

The foregoing detailed description and examples have been given for 
clarity of understanding only. No unnecessary limitations are to be understood 
therefrom. The invention is not limited to the exact details shown and described, 
1 5 for variations obvious to one skilled in the art will be included within the 
invention defined by the claim. 
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WHAT IS CLAIMED IS:. 

1 . A recombinant method for identifying a bioaetive peptide comprising: 

(a) transforming a host cell with an expression vector comprising a 
tightly regulable control region operably linked to a nucleic acid sequence 

5 encoding a peptide; 

(b) growing the transformed cell under conditions that repress 
expression of the peptide; 

(c) inducing expression of the peptide in the transformed host cell; 

(d) determining whether expression of the peptide is inhibitory of host 
10 cell growth, wherein inhibition of host cell growth is indicative of the expression 

of a bioaetive peptide. 

2. The method of claim 1 wherein the tightly regulable control region of the 
expression vector comprises at least a portion of the wild-type E. coli lac 

15 promoter/operator region, said portion comprising auxiliary lac operator 03, a 
CAP binding region, the -35 lac promoter site, the -10 lac promoter site, lac 
operator Ol , lacZ Shine-Dalgarno sequence and a spacer region; and wherein the 
transformed host cell comprises an amount of Lac repressor protein effective to 
repress expression of the peptide during step (b). 

20 

3. The method of claim 2 wherein the host cell is a bacterium. 

4. The method of claim 3 wherein the bacterium is a gram positive bacterium. 
25 5. The method of claim 3 wherein the bacterium is gram negative bacterium. 

6. The method of claim 3 wherein the bacterium is E. coli. 



7. The method of claim 2 wherein the host cell is a microbial pathogen. 

30 
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8. The method of claim 7 wherein the microbial pathogen is a member of a 
genus selected from the group consisting of Streptococcus, Staphylococcus and 
Entcrococcus. 

5 9. The method of claim 2 wherein the expression vector comprising the nucleic 
acid sequence encoding the peptide is a first expression vector, and wherein the 
host cell is further transformed, prior to step (b), with a second expression 
vector comprising a promoter operably linked to a gene encoding a Lac repressor 
protein. 

10 

10. The method of claim 2 wherein the expression vector has the identifying 
characteristics of pLACl 1 (ATCC No. 207108). 

1 1 . The method of claim 10 wherein the expression vector is pLACl 1 (ATCC 
15 No. 207108). 

12. The method of claim 1 wherein the host cell comprises proteases or 
peptidases or both. 

20 13. The method of claim 1 wherein the host cell has not been modified to reduce 
or eliminate the expression of naturally expressed proteases or peptidases. 

14. The method of claim 1 wherein the host cell is a prokaryote. 

25 15. The method of claim 1 wherein the host cell is a microbial pathogen. 

16. The method of claim 15 wherein the microbial pathogen is a member of a 
genus selected from the group consisting of Streptococcus. Staphylococcus and 
Entcrococcus. 

30 

17. The method of claim 1 wherein the host cell is a eukaryotic cell. 

18. The -jnethod of claim 17 wherein the eukaryotic cell is a mammalian cell. 
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19. The method of claim 17 wherein the eukaryotic cell is a cancer cell. 

20. The method of claim 1 wherein the host cell is a protozoan. 

5 

21 . The method of claim 1 wherein the peptide comprises a first stabilizing 
group comprising the N-terminus of the peptide and a second stabilizing group 
comprising the C -terminus of the peptide. 

10 22. The method of claim 21 wherein the first stabilizing group is selected from 
the group consisting of a small stable protein. Pro-, Pro-Pro-, Xaa-Pro- and Xaa- 
Pro-Pro-; and wherein the second stabilizing group is selected from the group 
consisting of a small stable protein, -Pro, -Pro-Pro, -Pro-Xaa and -Pro-Pro-Xaa. 

15 23. The method of claim 22 w herein the small stable protein is selected from the 
group consisting of Rop protein, glutathione sulfotransferase, thioredoxin, 
maltose binding protein and glutathione reductase. 

24. The method of claim 1 wherein the peptide comprises a stabilizing motif. 

20 

25. The method of claim 24 wherein the stabilizing motif comprises a 
hydrophilic cx-helix motif. 

26. The method of claim 24 wherein the stabilizing motif comprises an opposite 
25 charge ending motif. 

27. The method of claim 1 wherein the peptide comprises a randomized amino 
acid sequence. 

30 28. The method of claim 27 wherein the peptide comprises a first stabilizing 
group comprising the N-terminus of the peptide and a second stabilizing group 
comprising the C-terminus of the peptide. 
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29. The method of claim 27 wherein the peptide comprises a stabilizing motif. 

30. A bioactive peptide comprising a first stabilizing group comprising the N- 
terminus of the bioactive peptide and a second stabilizing group comprising the 

5 C-terminus of the bioactive peptide. 

3 1 . The bioactive peptide of claim 30 wherein the first stabilizing group is 
selected from the group consisting of a small stable protein. Pro-, Pro-Pro-, Xaa- 
Pro- and Xaa-Pro-Pro-; and wherein the second stabilizing group is selected 
10 from the group consisting of a small stable protein, -Pro, -Pro-Pro, -Pro-Xaa and 
-Pro-Pro-Xaa. 

32. The bioactive peptide of claim 3 1 wherein the small stable protein is 
selected from the group consisting of Rop protein, glutathione sulfotransferase, 

15 thioredoxin, maltose binding protein and glutathione reductase. 

33. The bioactive peptide of claim 3 1 wherein the first stabilizing group is Pro- 
Pro- and the second stabilizing group is -Pro-Pro. 

20 34. The bioactive peptide of claim 30 wherein at least one of the first and 
second stabilizing groups comprises a small stable protein. 

35. The bioactive peptide of claim 34 wherein the small stable protein is a four- 
helix bundle protein. 

25 

36. The bioactive peptide of claim 34 wherein the small stable protein is 
selected from the group consisting of Rop protein, glutathione sulfotransferase, 
thioredoxin, maltose binding protein and glutathione reductase. 

30 37. The bioactive peptide of claim 36 wherein the small stable protein is Rop 
protein. 



38. The bioactive peptide of claim 30 which is an antimicrobial peptide. 
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39. The bioactive peptide of claim 30 which is a therapeutic peptide drug. 

40. A bioactive peptide comprising a plurality of sequential uniformly charged 
5 amino acids comprising the N-terminus of the bioactive peptide and a plurality 

of sequential oppositely charged amino acids comprising the C-terminus of the 
bioactive peptide. 

41. A fusion protein comprising a four-helix bundle protein and a polypeptide. 

10 

42. The fusion protein of claim 41 wherein the four-helix bundle protein is Rop 
protein. 

43. The fusion protein of claim 42 wherein the polypeptide comprises a 
1 5 bioactive peptide. 

44. The fusion protein of claim 42 wherein the four-helix bundle protein is 
covalently linked at its C-terminus to the N-terminus of the polypeptide. 

20 45. The fusion protein of claim 42 wherein the four-helix bundle protein is 
covalently linked at its N-terminus to the C-terminus of the polypeptide. 

46. A polypeptide comprising: 

a bioactive peptide comprising (a) a first stabilizing group selected from 
25 the group consisting of a small stable protein, -Pro-, -Pro-Pro-, -Xaa-Pro- and - 
Xaa-Pro-Pro- and (b) a second stabilizing group selected from the group 
consisting of a small stable protein. -Pro, -Pro-Pro, -Pro-Xaa and -Pro-Pro-Xaa; 
and 

a cleavage site immediately preceding the first stabilizing group; 
30 wherein the second stabilizing group comprises the C-terminus of the 
polypeptide. 
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47. A polypeptide comprising: 

a bioactive peptide comprising (a) a first stabilizing group selected from 
the group consisting of Pro. Pro-Pro-, Xaa-Pro- and Xaa-Pro-Pro- and (b) a 
5 second stabilizing group selected from the group consisting of -Pro-, -Pro-Pro-, - 
Pro-Xaa- and -Pro-Pro-Xaa-; and 

a cleavage site immediately following the second stabilizing group; 
wherein the first stabilizing group comprises the N-terminus of the polypeptide. 

10 48. A polypeptide comprising: 

a bioactive peptide comprising a plurality of sequential uniformly 

charged amino acids comprising the N-terminus of the bioactive peptide and a 

plurality of sequential oppositely charged amino acids comprising the C- 

terminus of the bioactive peptide; and 
1 5 a cleavage site immediately preceding the plurality of sequential 

uniformly charged amino acids. 

49. A polypeptide comprising: 

a bioactive peptide comprising a plurality of sequential uniformly 
20 charged amino acids comprising the N-terminus of the bioactive peptide and a 
plurality of sequential oppositely charged amino acids comprising the C- 
terminus of the bioactive peptide; and 

a cleavage site immediately following the plurality of sequential 
oppositely charged amino acids. 

25 

50. A method for using an antimicrobial peptide comprising: 

covalently linking a first stabilizing group to the N-terminus of the 
antimicrobial peptide and a second stabilizing group to the C-terminus of the 
antimicrobial peptide to yield a stabilized antimicrobial peptide; and 
30 contacting a microbe with the stabilized antimicrobial peptide. 
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51 . The method of claim 50 wherein the first stabilizing group is selected from 
the group consisting of a small stable protein. Pro-. Pro-Pro-, Xaa-Pro- and Xaa- 
Pro-Pro-; and wherein the second stabilizing group is selected from the group 
consisting of a small stable protein, -Pro, -Pro-Pro, -Pro-Xaa and -Pro-Pro-Xaa. 

5 

52. The method of claim 50 wherein the first stabilizing group is selected from 
the group consisting of Pro-, Pro-Pro-, Xaa-Pro- and Xaa-Pro-Pro- and the 
second stabilizing group is selected from the group consisting of -Pro, -Pro-Pro, 
-Pro-Xaa and -Pro-Pro-Xaa. 

10 

53. A method for using an antimicrobial peptide comprising: 

covalently linking a plurality of sequential uniformly charged amino 
acids to the N-terminus of the antimicrobial peptide and covalently linking a 
plurality of sequential oppositely charged amino acids to the C-terminus of the 
15 antimicrobial peptide to yield a stabilized antimicrobial peptide; and 
contacting a microbe with the stabilized antimicrobial peptide. 

54. A method for treating a patient having a condition treatable with a peptide 
drug comprising administering to the patient a stabilized form of the peptide 

20 drug. 

55. The method of claim 54 wherein the stabilized form of the peptide drug 
comprises a first stabilizing group comprising the N-terminus of the peptide 
drug and a second stabilizing group comprising the C-terminus of the peptide 

25 drug. 

56. The method of claim 55 wherein the first stabilizing group is selected from 
the group consisting of a small stable protein. Pro-. Pro-Pro-, Xaa-Pro- and Xaa- 
Pro-Pro-; and wherein the second stabilizing group is selected from the group 

30 consisting of a small stable protein, -Pro, -Pro-Pro, -Pro-Xaa and -Pro-Pro-Xaa. 



57. The method of claim 56 wherein the small stable protein is a four-helix 
bundle protein. 
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58. The method of claim 56 wherein the small stable protein is selected from 
the group consisting of Rop protein, glutathione sulfotransferase, thioredoxin, 
maltose binding protein and glutathione reductase. 

5 

59. The method of claim 55 further comprising, prior to administration of the 
stabilized form of the peptide drug, covalently linking the first stabilizing group 
to the N-terminus of a peptide drug and covalently linking the second stabilizing 
group to the C-terminus of the peptide drug to yield the stabilized form of the 

10 peptide drug. 

60. The method of claim 54 wherein the stabilized form of the peptide drug 
comprises an opposite charge ending motif. 

15 61. The method of claim 60 further comprising, prior to administration of the 
stabilized form of the peptide drug, covalently linking a plurality of sequential 
uniformly charged amino acids to the N-terminus of the peptide drug and 
covalently linking a plurality of sequential oppositely charged amino acids 
comprising the C-terminus of the peptide drug to yield the stabilized form of the 

20 peptide drug. 
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Set} iD ^jo^ 



toc7 
stop 



PGCAGTGAGCGCAACOHAATt | ^^ 

03 CAP 



-35 



SD 



\acZ 
start 



TGCTTCCGGCTCqTATGTlkTGTGG AATTGTGAGO 

-10 oi 




wo 
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3*1 n i 



BssH II 5215 
BspI20I50Il 
ApaI50l 1 
BslEH49S5 
BdMSIS 
MluI4S04 



7 



laclq 




NcoI437i y 
AatII4296_ 

Ssp 141 SO" 



ScaI3S56' 
Pvu 13745' 

PstI3619' 



lacPO 



pIAC22 
5652 bp 



Amp 



on 



EsaI3445 / 
Ahd I 3373' 



AlwNI2896 J 



Rop 



inD III 41 
s T hc I 24 1 

■amHI3S7 
.S°rA 142! 
,Sph I 574 
coNI634 
al I 663 
f shA 1 724 
JEagI951 
nj 19S4 
spMI1075 

3sml 1365 
-Ava 1 1437 
Msc I 1456 
^r^spMII 1676 
1 BspEI1676 
BS23I 1680 



\w-Tthin 12229 

vV ^BsaA 12237 
X XcaI2256 
>nal 2256 
Kde I 2307 
>apI2362 
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Pvu 113585 
AalI7 3495 
SspI3379 
Ear 1 3366^4. 




Bsa 12644 
Ahd I 2572 



HglEII2265 

AlwNI2095 



inD JH4I 

coR V 197 
he I 241 

arnHl3S7 
S^rA 1421 

spH I 574 
ph I 574 
^7524 1574 
coNI634 
ccI663 
all 663 
PshA I 724 
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ruI9S4 
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xaa 

xaa xaa xaa 
met glu" asp- glir asp" XQa XG ° xaa 

xaa 

Lys+ arg+ l ys + arg + 

xaa v _ xaa 

xnn xaa 
vnn xaa 
xaa xaa 
xaa 



fa ■ ^ 
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SEQUENCE LISTING 

<no> university or Georgia research foundation, inc. 

<120> STABILIZED B I OA ST I VE PEPTIDES AND b:tL7HODS OF 
IDENTIFICATION, SYNTHESIS AMD USE 

<I3D> 235 . 00010001 

<1 4 0'-* Unas signed 
<141 • 1999-10-13 

<15C - 60/104 , 013 
<151 • 1996-10-13 

<150> 60/112, 150 
<151> 1998-12-14 

<16C 110 

<170 ■ Patentln Ver. 2.0 

<210> 1 
<211 ■ 133 
<212> DMA 

<213> Escherichia coii 



<400> 1 

ggcagtgagc gcaacgcaat taatgtgagt tagctcactc attaggcacc ccaggcttta 60 
cactttatgc ttccggctcg tatgttgtgt ggaattgtga gcggataaca atttcacaca 120 

133 



ggaaacagct atg 

<210> 2 
<211> 25 
<212 > PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: peptide 
having opposite charge ending motif 

<4 00> 2 

Met Glu Asp Glu Asp Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa X£ 

Xaa Xaa Xaa Xaa Xaa Arg Lys Arg Lys 

2 0 2 5 



1 



WO 00/22112 



PCT/US99/23731 



•• i - > 

^21- * - 4 

< 2 1 • Ariiiicial S e q u c^r.ce 



['-£cr:p:io:; of Artificial SoqjeriC: 
a:::ic:ens:i. 



e4 00: 
Pro p 
1 



sp Arc Yal Tyr ile Hi:' trc 



Frie Hi; 
10 



Tie ?r: Pre 



•:2 !(•:• 4 

18 

■ : 2 1 2 : ■ Artiiicicl Sequence 
-^i^O' - 

•:22 3 ■ Description of Artificial Sequence: stabilized 
angiotensin 

-:4G0-' 4 

Glu Asp Glu Asp Asp Arg Vai Tyr He His Pre The His He Arc Pys 
1 5 10 15 

Arq leys 



<210 > 5 
<211 - 10 
<:.12 - FRT 

• 2 ] 3 • Hot: sapiens 
4 0 0 • 5 

Asp Arq VH Tyr He His Pro PH<- His lie 



21 : .' 2- 
212- HNA 

2I3> Artificial Sequence 



WO 00/22112 



; :r 2 pt n o: hrtiri::^! Sequence: rr 
4 ■? C ^ ( 

:::ocitt^i c~csagqcac 

; 1 :: • 7 
:i: ■■ 4: 
_ 12 • d::a 

213- Artificial Sequence 
2 2 0 

_23 * Description of Artificial Sequence: rri:uc 

■;oo ■ 7 

t a a a 1 1 c a taagatcttt cctgtgtgaa attgttatcc g t 
: 1 0 ■ 8 

2 i : * 3^ 

212 ■ d::a 

213 • Artificial Sequence 



22? : Description of Artificial Sequence: primer 
-clOG-- 8 

jttaiiattca ccatggacac catcgaatgg tgcaaaa 

.210 • 9 
;211 - 19 
212 • DNA 

212 ■ Artificial Sequence 
2 2 1 ' 

22/ - Description of Artificial Sequence: primer 

■JOu 9 
o 1. 1 ■': *: t geca ttgctqcag 

10 
4 3 

d:;a 

Artificial Sequence 



Description of Artificial Sequence: prir.er 
10 

tgtatgaa-^ cccgggtacc atggtfgaag acgaaagggc :i: 



-.21'"' • 
-.21 i • 

■21^ > 
• 2 1 > - 
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3 l 

di;a 

Artificial Sequence 



22 — Descripricr. cf Artificial Sequence: primer 

4 

.-. r.«taga: ctataacca: gatcacgqat tcaetg 

21 — 12 
2 1i.* 36 

: i: • diia 

213 ■ Artificial Sequence 
j .20 ■ 

j23- Description cf Artificial Sequence: primer 
4 00/ 12 

arataaagc ttggcctgcc cggttattat tatttt 

.2 1:: > 13 
2 i 1 - 4 7 
-:2] 2 ■ DNA 

■213- Artificial Sequence 



^22:- - Description of Artificial Sequence: primer 
<.:rv? -13 

tjt.ritctgc agaggaaaca gctatgacca tgattacgga ttcactc ^ 1 

< 2 1 0 - 1 4 

• 2.11 > 4 7 

• 2i; • DNA 

•■■J;"- * Artificial Sequence 

■-22' ■ Description of Artificial Sequence: primer 

• 4 -.J" - 14 

t.u — tactco agcaggaaag cttagcctgc ccggttatta ttatttt 
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Art 1 1 icic- 2e :u-j: 



;. je5::ip: i :»r. c f Art ifici - 1 Sequ-:.c-: t n:te: 

4 00> 15 

it cat ca at ccagqaaaca grtatgacci toi:tacgga ttcacta 

210:- 16 
ill:- 36 

dm a 

/ll. Artificial Sequence 
220: 

<_-v Description of ArtiLicicil Sequence: pr liter 
4 00 ■ 16 

actatagat etatggctat cgacgaaaac aaacag 
2 : C • 17 

■ 4 0 

212 • DNA 

213 ■ Artificial Sequence 



1.23-- Description of Artificial Sequence: primer 
4 00 - 17 

tatataagc ttttaaaaat cttcgttagt ttctgctacg 
210 * 18 

j : : 35 

212 - DNA 

212 * Artificial Sequence 
220 > 

2^7 * Description of Artificial Sequence: primer 
400 - 13 

art^tagac ctatgaacaa aggtgtaatg cgacc 

::!■:.) - 19 

■211 > 2 5 

212 • CNA 

21?- Artificial Sequence 



Description cf Artificial Sequence: prirr.er 
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a«.;agia:ai ,;:a:a^tc t. c t q ■:: a a n a a q:c:: 



v ; 1 • Artificial Sequence 



<.:2 3;. Description of Artificial Sequence : prmer 
f raqr.ien: 

<. *; 0 0 ' - 2 o 

oOiitotiiaio aauc 

< 2 : o > 2i 

• ; 1 1 > 15 

> dna 

,.:■:•> Artificial Sequence 
O20 > 

-.2:2 > Description of Artificial Sequence: primer 
i ragment 

a q a tctv.atg a a 1 1 c 

• 2 1 0 > 2 

< 21 2> DMA 
2\:-> Artificial Sequence 

-:22 0:-> 

. ;_ j :-. description cf Artificial Sequence: primer 
fragment 

i ■;; atcttatg a a 1 1 c 

• 21' * 2':-. 
- 211.- 9 2. 

-.2:2.- di:a 

• 2 1 :• - Artificial Sequence 
-:22C> 

<22V> Description cf Artificial Sequence: rancoxi :eci 



15 



15 
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■tavcnrir.r.ri ru.r.r.rir.r.rir.n rircuirumnr-n r.r.ru. n n r. :i r. r. : i rente *c_' 

Uitnr. c 5 ^ r a a - a a ^ t :*: c j a c a 

< :■ 1 1 > :e 
• i:;a 

<:1 I 3> Artificial So quer.ce 



•a:13:- ■^^cr:p;ic:i c: Artificial Sequence : primer 

• •;c o.- 14 

iriic^j^ji tcftatid 

- ; m • ii 

•111: 10 
r>llA 

■:.:ll: Artificial Sequence 

• 120 • 

•'111 Description of Artificial Sequence: primer 
<.i;00 • 2 5 

* cattaatgc agctggcacg 

•:210 ■ 2 6 
■:211 • 2 0 
•112 • DNA 

■:2 13 • Artificial Sequence 
•:220 • 

■■:j2 3 ■ Description of Artificial Sequence: primer 
•000- 16 

ttcafaoacq gtgcctgact 

•110 ■ ."7 
■:21 1 > 10 

tin * d::a 

c 2 1 3 * /artificial Sequence 

• ■ 1 2 C- ' 

•'2 23' Description c: Artificial Sequence: primer 

• 40 ."• • 2 7 



t a t c t c a c t c a 1 1 a g g c a c c 
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<2 DKA 

■:j12^ Art: ::-:a: ^ec;en:- 



• :21'5> Descriy:iu:"i c; A: ::;ici al Se :uer.:r : prime: 
•:4 0j> IB 

gatgcccatg ac:gcattct 



:i0 
:1 1 



3' Artificial Sequence 

:223> Description of Ar::r :cia^ Sequence: antisense 
c 1 i aonucl eot ide 



■:400:- 29 

t^ctatagat ctaccgt :ac tgaattttgt ggcttgttgg accaactg; 
tgqaaqactg aaattaacaa gaattctcga ca 



ttagtaatag 60 
92 



'2:0 20 

•'2 11- 91 

<2 12 * DNA 

<2 13 • Artificial Sequence 
<2 20 • 

• ■' 2 ■ 3 • [lescription :> f Artificial Sequence: antisense 
oiigcnucieot ide 

<4 00 ■ 30 

tact at a gat ctacqtggcg gqactcatqg attaagggta gggacgtggg gcttatgggt 60 
taaantagtttgataataagaattctcgaca 91 



: :. ; c • ?1 

'211 ■ 92 

:212 • L'NA 

:213- Artificial Sequence 



[;ec:r:ptior. or .Artificial S e q u e n c e : a n t isensc 
cliaon u c iectia e 



WO 00/221 12 



PCT/US99 23731 



ta:^:^::;' rt " c / □ :> a :.: a^a :a^^t;:-^ 

g c t a : a c * u t a "tfiatc a a a a t ::: c :_: a c a 



c ^ c - 1. 5: : 



DNA 



212;- Artificial Sequence 



Des2r;r::cn cf Art: filial Sequence: an tiser.se 
c2i:or.uj]e:ticle 

tactatagat ctacggaccg tgaagtgatg tgtgcggcaa aacaggaatg gaaggaacga 60 
,j.\jc. jtaoj c c g c g t a a t c; a g a a 1 1 a t c a a a a 2 2 

- 2 : r • 32 

• 2] 1 :■ 22 

- 212. DNA 

•Ji3:- Artificial Sequence 

*\2 2 3a Description of Artificial Sequence: antisense 
oligonucleotide 

■ A 00 ■ 3 3 

tactatagat ctacgagggg cgccaactaa ggggggggga aggtatttgt cccgtgcata 60 
atctcgggtgttgtctaataagaattctcgaca 93 

■210 > 34 

-.■I'll • 13 

■■2 12 - FRT 

•213' Artificial Sequence 

■ 220 • 

•223- Description of Artificial Sequence: stabilized 
pept i de 

■ 4 02 ■ 34 

Met 7a i Tnr G2u Pre Cys Giy Leu Leu Asp Gin Leu Pro 
i 2 1 0 




6 6 
DKA 

Artificial S e gu e n ce 
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<2 22:- Oes:r:r::::i of Artificial Sequence: nucleic acid 
er,:co;r::; stabilized peptide 

• i : ;<.• 35 

cag^r-aaga: coat^tca: t g a a t t 1 1 g t gc;::q::q3 aceaactg 
t g a a a g g r c g a a a 1 t a a t a a a a a 1 1 c 

■ v< 36 
< :.: : ■ ic 

- 212 ■■ ?RT 

• 1:13:- Artificial Sequence 

• 220. > 

• 2 2--^ Cescr:p:iCo of Artificial Sequence: stabilized 

pept ide 

- -100 - 36 

M-.-t 'Irp Arg Asp Ser Trp lie I.ys Gly Arg Asp Vai Gly Phe Met Gly 
i 5 10 15 



1 1 a g t a a t a g 6 0 
be 



-..:10.- 37 

<.:il> 8 5 

v.'.12> DMA 

13-- Artificial Sequence 

<220'> 

<22 3> Description cf Artificial Sequence: nucleic acid 
encoding stabilized peptide 

•■4 00 > 37 

caggaaagat ctatgtggcg ggactcargg attaagggta qggaegtggg gtttatgggt 60 
raaaatagtttgataataagaattc 8 5 

•'210 > 38 
-J 11.* 14 1 
-212 ■ DMA 

■213- Artificial Sequence 
•■220o 

•22;-' Description of Artificial Sequence: nucleic acid 
encoding stabilized peptide 

<4 0: > 31 

caggaaagat ctatgtcagg ggqacatgtg acgagggagt geaaqtegge gatgtccaat 65 

cgttqgatct aegtaataag aattctcatg tttgacagct tateatcgat aagctttaat 12;: 

gcggtsgttt at c acagttaa 
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s i i . ' * ± 

■ Fr.'i 

213'' Aru fi- 




< 4 GO > 3 2 

Met Ser GI y Gly H i : - Val Thr Arc Glu Cy? Lys Ser Ala M- t 5 or" Asn 
Arj j i p lie 1 yr V^i j. - e 




lie 



Asm Ala Va 



Tyr 

A 0 



oi:> 6 

"'.:12:- PRT 

13> Artificial Sequence 

• 220> 

•223:- Description of Artificial Sequence: stabilized 
pept ide 

'■•ICC ■ 4 0 

Met Tyr Leu Fhe He Gly 
1 5 



•210- 4 1 
■211 ■ 75 

- 212 > d::a 

• 212 * Artiricial Sequence 



riptior; of Artificial Sequence: nucleic acid 
uinc stabili :>=:i peptaae 

v A , . 4i 

cagcaaagat ctatgtattt gttcatcgga taatacttaa tggtccgc~g gagaacttcf: 62 
otttaataacaattc 



Desc. 
e n c o 
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: 1 . ■ ■ 

: : : • v " 

: 1 2 • Ai z : f i a I Soquer 



*:2 2 3> jescripticn of Artificial Sequence: nucleic aci'j 
encoding stabilized p£p:id'j 

cagcaaagac c:a:g::rct atttgggggg gactqcgg::r a gaaagccg "i atactttact 6( 
c'.ajraccn: caagg:aa:a agaat:: 8 ~ 

•:2 : :• 4 2 

• 2 1 2.- FF.T 

•:_13:- Artificial Sequence 



• 2. ? . Description of Artificial Sequence: stabilized 
pieptide 

<AQC:- 43 

Met leu Leu Phe Giy Giy Asp Cys Giy Lys Ala Giy Tyr Pne Thr Vai 
1 5 10 lb 

Leu Pro Ser Arg 
20 



■:j10.- 44 

■Oil:- 75 

•2 12. DNA 

-■J 1 3 ■ Artificial Sequence 

<220:- 

•'22 v- Description of Artificial Sequence: nucleic acid 
encDding stabilized peptide 

• 4 0 C ■• 4 4 

z-jgcaaacat ctatqattgc gggat. cgttg accttcgcct gggcaatagt ttgraataag 60 
na:;ctcatg tttga */ : 

•:210 ■ 4 b 

111- 2 C 

2 12 PPT 

212.' Artificial Sequence 



12 
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pcptice 
* 4 I : : • 4 

Met lie Gl y Gly S e r Leu Ser ?• h e Ala Trp A. a lie V-i 2'ys As:: l.ys 
1 ' 10 II 

/-.so ?er His Y a 1 



• 11C- 4 6 
^111- 1 4 

*1"13.' Artificial Sequence 
-.220 • 

•113 • Description of Artificial Sequence : stabilized 
pept ide 

v4 00 ■- 4 6 

Met Asn Gly Arg Thr Lys Arg lie Arg Asp Pre Pro Ala Ala 
1 5 1C 



<:2i0 > 47 

<211> 8 6 

<212:> DNA 

•-213 > Artificial Sequence 
'■■220 • 

22 3 • Description of Artificial Sequence: nucleic acid 
encoding stabilized peptide 

*'400.- 4 7 

caggaaaqat etatgaaegg ccgaaccaaa egaatceggn acccaccagc cgcctaaaca 60 
■^cta zcaqct gtggt aat a a ga st tc S6 

4 K: 
IB 
PR? 

Artificial Sequence 
•;s2C* 

<223 > Description of Artificial Seq^er.ce: staril.zeo 
peptide 
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• 1 . ■ 4: 

1 i \j 



<:2il> £~ 

-:2i;:> d:;a 

-.J13 - Artificial Sequence 
v220> 

-.223.- 2 ^ s ; r i p t i o r. of Artificial So q u o i . c -_■ : nurloic i jij 
encoding stabilized peptide 

-400- ^0 

caggaaagat ctatcgaceg tgaagtqatri tgtgcggcaa aacaqgaat:: gaaggaacqa 60 
acgccatagg ccgcgtauta agaatt.c 6 7 

<210*- 50 
*:211 > 87 
<212> DNA 

• : 2 1 2 • Artificial Sequence 
<22 0 • 

<222.* Description of Artificial Sequence: nucleic acid 
encoding stabilized peptide 

<4 00 - 50 

oaqgaaag^t ctatgtagcc caatacactg ggaqcacgcg tgttaggtct agaagccacg 60 
tacc:attta atccataata agaattc 87 

•:21/ - 5 1 
till • 12 
•."211 • ??.T 

■■:2il.' Artificial Sequence 

•:223* Description of Artificial Sequence: stabilized 
pert ice 

■■:4J:- 11 

Met Leu Sly Leu Glu Aia Tnr Tyr Iro Pine Asn Pro 
1 b 10 



1 A 
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:12.- PS? 

. 1 ; . Artincia: t ~ ; ..en :•: 



peptide 
*:-;oo> 52 

Met ;.rq 01 y Ala Asn 
1 



■ 5 3 

1 1> 87 
et 1 / DMA 

•313: Artificial. Sequence 

* ' 1 ;. ."- • Description cf Artificial Sequence: r.ucleic acid 
encoding stabilized peptide 

v4(v , 53 

cagcaaaq a t ctatgaggcg cgccaaccaa ggggggggca aggtatctgt cccgtgcata 00 
at. ctcgggtg ttgtctaata agaattc 87 

5 4 
4 

FRT 

Artificial Sequence 

Description or Artificial Sequence: K-tormnai 
protective sequence 

< oo • 54 

a a Ore Pro X a a 



-:S1 
i 3 ■• 



5 5 

dda 

Artificial Sequence 
•:223..- Descriptic:: cf Artificial Sequent-- primer 
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r. r t" c 



a a;:ac 



. 1 J . * ~ o 



. 1 1 1 c l a - sequence 



L2; ■ De.^cr:ntior. of Artificial Sequence: prime r 



4 00 5 6 

.it.a^gtatt cagttgccca catgttcttt c 



ctcc 



3 6 



4 1 

DNA 

Arti i;cic 



<^_2i - Description of Artiiicial Sequence: prime i 
<AO0> 5 7 

a. a tt cat act atagatctat gaccaaacag gaaaaaaccg c 

<210.> 5 8 
<21:,' 4 2 
<:212 • DNA 

0213 - Artificial Sequence 



Description of Artificial Sequence: primer 



v 4 00 • 5 -3 

Utataatac atgtcagaat tcgaggtttt caccgtcatc a c 



4 2 



21 j. - 5 j 
21] - 96 
:2)2 > DNA 

'21^- Artificial Sequence 



: 2 2 u 



Description; :f Artificial Sequer. 
o. ico nucleotide 



ranaomi ze; 



< 4 'J 2 . > t j 

tactataaat ct ator.nnnn nnnnnnnnnn r.nnnnnnnnn nnnnnnnnr.n nnnnnnnnnn 



16 
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rmr.nr nr. nrconr. ca c ac at;:* tgrga: 

•.: ; ; t. • 

• i : : i 

■::i: DIjA 

1 .- Artificial Sequence 



.13 1 Poscription cf Artificial Sequence: trie, or 
:4C0 * CO 

itoaeaacac geagatccat g 



• :::a 

:l.v- Artificial Sequence 



1 • Inscription o: Artificial Sequence: raricion\i cec 
oligonucleotide 

,'4Q[V-. 

t a-ctataaat tcnnngaatt ctcccaccac tact at 36 
21 

Cii/ UNA 

-~ 21 3> Artificial Sequence 
< 2 2 0 • 

-'22~:>'- I'oscripticn of Artificial Sequence: primer 
'■400:- £2 

at act a g egg tggcagaatt c 21 

t 3 
105 
L-!IA 

Art i f icia 1 Sequence 



i : - 

vlll :• 



- : 2 2 '• loscriptior: cf Artificial Sequence: ran tier.; ze ;i 
o 1 i g c n u c 1 e o t i d e 

< 4 0 > C 3 

tactatagat ccatccegcc gnnnnnr.nnn r.nnnnnnnr.r. r.r.nrcr.r.r.r.r. n nr. nnnnr.nn € C 
n n n n r i r i n n n n n n n r. n n n n n r. r. c c a c c c t a a t a a g a a 1 1 c g t a c a t i r : 2 
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■'ii:-:- 6 

•hi li;a 

<2il> Art i : i c : - - S-Tj-n:^ 

■ ' 2 2 C : • 

■22 2:- l-'-e script i on of nru:;c;al Sequer. co : r,r in.er 

■ 4 02 - 6 4 

atqt. jcoaat: tc:tr.tta::: a o g ~: 

• 21'. 25 
-21:^ 90 

■ 21.'. ■ UNA 

• 2 " • 2r::f:c:ai Sequence 



■ '21 2 • inscription of Artificial Sequence: r ari o :ra zee: 

oligonucleotide 

-• 4 C o ■ 6 5 

t^ct jtagat ctatqvanva n van van van vanvanvanv anvanvanva nvanvanvan 60 
vanvantaat aaqaattctci ccagcactat 90 

oi: 0 * 66 
-'2il> 24 
•■'212 • PKA 

- 2 1 2 ■■ Artificial Sequence 
■ .' 2 2 ; "i "■ 

• 2 2 ' J i ' ■ 2e scription of Artificial Sequence: prmer 

• ■•10'.' 1 • 6 6 

atagtgeteg cagaattctt atta 2 4 

■21m . t.7 
-111 ■ 105 

■ 2: 2 • L2JA 

0112 • Artificial Sequence 
•:220 ■ 

•22 3- Descriptior. of Artificial Sequence: r an eforo 1 ceci 
cl iainucieetice 

<4G0.> 67 

tactatagat otatgaaaga egaagaennr. nnnnnr.rinrin nnnnnnnnnn nnnnnnnnnn 61 
n n nr.n n n : 1 n ri ri n n nr. cat a a a c i r. a a a t a a taa a a a 1 1 c g t a cat 1 0 5 
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2 1 0 > 

: : : . 3 : 

in spa 

1 1 lo ■ Aniricicl Sequence 



«.:21 . U;cr:p^:cn cf Artificial Sequence : p r;:i-r 

srcurga^L :ct:ai.:a:t Lacgttiac; 30 

• : : . • i ::a 

• : j 1 3 Artificial Sequence 

• ..:20> 

v.:2?s- Description of Artificial Sequence : r.i:Ieic acid 
e n c c- ding stabilize d peptic e 

-MOO:- 69 

agatctatgc cgccgattct atggggcqaa gcgagaaag: gcttgtggqc t.cgggatcat 60 
d::aco::c:t aaraaqaart : 6- 

10> 7 0 

Oil:* 21 

•:212> PRT 

•■1 13:* Artificial Seouence 



2 3'- Description or Artificial Sequence: stabilized 
peptide 

• -1 0 J > 7 : 

Met Pro Pro lie Leu Trp Giy Glu Ala Arg Lys 7-.ro Leu Trp Gly Gly 
1 3 13 13 

7o-p His r inr Pro Pro 
20 



j. . t - 

111 ■ 9 0 

2 12 > si: a 

211- Artificial Sequence 



WO 00/221 12 PCT/US99. 23731 



■2'.- C-35':r; p t i . r. ; : Am. Seq';en:-: : rru ui e ; r a : c 

encoding otanliooc pe:tide 



a'jatciat:; cc:::a:::a ::a:j:tcrq rcggg^airc a:;j:a::'n; j c a t T t q t c :: 2 ; 
ta:cg:c^a ttaa^aar:: ::^tq::taa p. 

-r ■' 1 1'' » 7 2 

•:211 ■ 1 ~ 

•712 - PPT 

•2 17 - Artificial Sequence 



•223- Description of Artificial Sequence: stabilized 
peptide 

• .;Ci- 72 

Met. Pro Pro Pro Leu Asp lie 7-1 Ser Giy lie G .; u Va ; G 1 y Giy Hi: 

1 2 10 2 5 

Leu Trp Cys Arg Arg lie Lys Asm Ser His Va 1 
20 25 



210 - 73 

2 1 r-> si 

■ 7 1 2 : - DNA 

.213:* Artificial Sequence 
220-' 

-2 2 .': •• Description of Artificial Sequence: nucleic acid 
encoding statilized peptide 



- 4 07 • 7 3 

aqatctatgc cgccggacaa tccggtcctg tgatgaagcg gaggtcgacc a^ggggatat 60 
c a g c c g c c g t a a t a a g a a 1 1 c £71 

• 210> 74 

•■211:- b 

•'212:- PPT 

'213:- Artificial Sequence 



Description cf .Artificial Sequence: stabilize:: 
peptide 



■ 7 u 0 > / 4 

Ket Pre Pro Asc Asn Pre '7a 1 Leu 



WO 00/22112 




PCT/L 'S99/23731 



• 7 c . 
v2:i> e: 
< 2 : :. - dna 

•2 13> Arti::cia: Sequence 
■ 220;* 

« cl?. :• Description of Artificial Sequence: nucleic acid 
encoding; stabilized peptide 

*MO0> 7: 

aqatctaicc ccncgctatt ggaeggagat gacaaataga tatacgegtg gttgtttttc 60 
*_ g t c c g c e g c a a t ^ a a a a t t c ? 1 

<21C> 76 

•:211o 10 

• 2 12:- P?.T 

■:213o Artificial Sequence 
<2 20> 

-:223> Description of Artificial Sequence: stabilized 
pept ide 

<400> 76 

Mot Fro Pro Leu Leu Asp Gly Asp Asp Lys 
15 10 



-.'-210.- 7 7 

<211> 7 9 

<212> DNA 

v'213> Artificial Sequence 
<220 > 

■:223- Description of Artificial Sequence: nucleic acid 
encoding stabilized peptide 

<400> 77 

aqatctatg: c g c c :i a g g t q g a a g a t g 1 1 g a t a a g a c a g t g a c a g a t g c g 1 1 c c a 1 1 a c t € 0 
cccgccgtas taaqaattc 1 ? 

-:21G- 76 

<:2ir> 11 

<2I2^ PRT 

<213> Artificial Sequence 



WO 00/22112 




PCT/US99 23731 



jes:r:.c:icn of nr:i: lcicl Sequence: stabilized 



< : A C ■ 

Met 
I 



•.rg Trp Lys 



lie Arg 
10 



;ir: 



7 9 
i ^ 
DUA 

Artificial Sequence 



Description of Artificial Sequence: nucleic acid 
encoding stabilized peptide 

a 'j a t c t a t g a tgagagtagc gccgccgtaa taagaattc 39 

<210> 8 0 
<tli> 7 
<212:- PR? 

-'jl3.- Artificial Sequence 
<220> 

<222.- Description of Artificial Sequence: stabilized 
peptide 

<4 0C> 8 0 

Met Met Arg Val Ala Pro Pro 
1 5 




■:2i0- 8 1 
Oil • 81 
-:212 ■ DUA 

•''2 1.2 Artificial Sequence 
Ol'C * 

-.222 - Description of Artificial Sequence: nucleic acid 
encoding stabilized peptide 

<A00 > e: 

agatctatqc cgccgttgeg cggqgeatgc gatgtatatg gggtaaattg aatctcttgt 60 
g g go ecc eg t aataagaatt c 81 



WO 00/22112 



PCTTS99/2373I 



•:2ii^ 14 

■:iI2 - FF.T 

-.212 - Artificial i-~ :e 

•;22C > 

H .: • Description ci Artificial Sequence: staliiinec 
pept i ce 

:~ 31 • £■ 2 

Met Pro Dro Leu Arc Gly Ala Cys Asp Val Tyi GV y V a 1 A sr. 

: l ig 



■:2 11 - 6 1 

12 - ON A 
•.:213- Artificial Sequence 



<22'3 > Description of Artificial Sequence: nucleic acid 
encoding stabilized peptide 

< 4 0 0 - 6 3 

agatttatgc cgccggggag aggggaagcg gtgggagtga catgcttga:: cgcqaacgtq 60 
tacc^gccgr aataagaatt c 81 

<2±0* 6 4 

<2il> 21 

ell 2 * PRT 

<113 • Artificial Sequence 
^220 * 

3 • Description of Artificial Sequence: stabilized 
p-ept i de 

<4 0C • ? 4 

Met Pre Pro Gly Arc Gly Giu Ala Val Gly Val Tnr Gys Leu Ser Ala 
1 b 10 15 

As:. Val Tyr Pre Pro 



- 211 • 8 5 

•'211 • 61 

-.212 • DtJA 

-.22 3 • Artificial Sequence 



WO 00/22112 




PCT/US99/23731 



■ . i r .': . ucijnpt::r. of Artificial Seqirr.:^ : r.i:le:: acid 
or. -;;,;1ir. J stabilized peptide 

<:4o:d- c c 

agitrtat^:: :j;cgcgaaq ggtagcqcte tttgccgeca ictttgccte cg:'s;-:iu:: €2 
ct cccg cc gt aacaagaact e SI 

«_iCc- 66 

1 I r- 21 

-212:- PRT 

-_13* Artificial Sequence 



<22 3> Description cf Artificial Sequence: stabilized 
pep: ide 

* •]• ••; • e t. 

Met Iro ?tj Gly Arc Val Va 1 Phe Phe Val Ala lie Phe Vai Ser Ala 
1 f 10 15 

lie Cys Leu Pro Pro 

20 



<21C> £7 
<2 1 1 ^ £i 
<2i2> DNA 

<213> Artificial Sequence 
<2 20 ^ 

<222' Description of Artificial Sequence: nucleic acid 
encoding stabilized peptide 

■:<]()'"' > i 7 

agatctatgc ccjccgag gtt cgctcatgag aqtgttaaag ggctggggga cgttacaaaa 60 
g z c. c g c c g t a a t a a g a a 1 1 c SI 

• 110.> 8 6 
-.ill > 21 
■ .212 > PPT 

-:213> Artificial Sequence 
•:220 - 

•'.2 22 > Description of Artificial Sequence: stabilized 
per t i ce 



WO 00/22112 




PCT/US99/23731 



!■:■:: i V-rz Pre Arg Fbe Ala Hi, 



2:.r leys Ala Pro Fr; 



Vai 
1 0 



<2 10 > 8 9 
-2 11* 72 
:2 12 - DF'A 

-:213 ■ Artificial Sequence 
•:220 - 

2^': ■ >ic crip t ion or Artificial Sequence: i.ujlei: 
encoding stabilized peptide 

• -;u : - 89 

.v:j/i:.:::atgc atgacgaaca agaagaggag cac aataaaa aggacaacgi; aaaaqaacac GO 
ti<i:aagaat t: 7 2 

•:210-> 90 

<211> 18 

•■212;- PRT 

■'212 ' Artificial Sequence 



<22? Description of Artificial Sequence: stabilized 
p>ept ide 

— 100> 90 

Met His Asp Glu Gin Glu Glu Glu His Asn Lys Lys Asp As:: Glu 'Lys 
15 10 15 

Glu Hi£ 



'210.- 91 
2 11- 7 5 
2 12-> DMA 



22C-: 



Artificial Sequence 



Description of Artificial Sequ e nee: n uclei: a c l ; 
encoding stabilized peptide 



■■4 02 



91 



WO 00/22112 
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a g t ? * .1 i :; ~ c g a g c a a g g s a g a a t g r : e a :: h a r r. a craita^* v: 



: 2 1 0 > 9 2 

: 2 1 1 > 2 2 

:: i: • pr-.-i 

212 * Artificial Sequence 



-2.:?' Description of Artificial Sequence: stabilized 
pept toe 

v i ; - 92 

Net Gir. Gin Glu His Glu Gin Gly Arg Met Ser Lys Arg Met Lys Asn 

: t :c 

Asn Lys Asn Ser His Val 



•'CIO/- 93 

'.'111:- 75 

<TJ12> DNA 

*;213> Artificial Sequence 



< 2 2 0 > 

<223> Description of Artificial Sequence: nucleic acid 
encoding stabilized peptide 

<A00> 9 3 

aqatctatga accatcataa tgaggccatg atcaacacaa tgaaaacgag gaataataag 60 
aattctcatg tttgs - 75 



<210> 94 

<:2il:- 22 

<212> PRT 

-:2: 3> Arti f icial 



Sequence 



v 2 2 3 : * Description of Artificial Sequence : stabilized 
peptide 

<4Cic- 94 

Net Asn His His Asn Glu Ala Met lie As:. TLr Net Lys Thr Arg Asn 
1 : M 15 



Asn Lys Asn 2- r Hi 5 Vai 



WO 00/22112 




PCT/US99/23731 



:2i2 ■ d::a 

.213- Artificial Sequence 



*:22 2 ■ Description c: Artificial Sequence: nucleic acid 
encoding stabilised peptiae 

• 4C : - 92 

agatetatqa aegacgacaa tcaqeaagag gataatcatg atcaocataa gcataaeaaa 60 
laiitaagaattc 72 

e210 • 96 

■ IS 

-:.:12 ■ PRT 

■213 ■ Artificial Sequence 
v220 .• 

vS23~- Description of Artificial Sequence: stabilized 
peptide 

v4C0-- 96 

Met Asn Asp Asp Asn Gin Gin Glu Asp Asn His Asp Gin His Lys Asp 
15 10 15 

Asm Lys 



<:210 • 97 
-SI 1 - 72 
* 212. DNA 

<- 2 I 3 > Artificial Sequence 
< ^20 • 

< 223> Description of Artificial Sequence: nucleic acid 
encoding stabilized peptide 

'4 0 0 . > 9 7 

agatctatgs aagageagga tcagcata-t gataaccatc acgaggataa acataagaag 6 0 
t a a t a a c a a t t e 7 2 

<210> 93 
<.2U> 13 
<212> PF 7 

2 7 



WO 00/22112 



PCT/US99/23731 



A r t i f i c 1 a 1 : equ^r.r- 



les c ripti: 
pect ide 



Art if 



c — ■ - r , , 



■■M0 0> 9B 

Met Gin GIu Gin Asp Gin His Asn Asp Asr. His H i s CLu Asc, 
: : 1 0 



His 



,ys 



• : 1 1 0 > Q 9 
9 3 

•:212> DNA 

3> Artificial Sequence 

<2 2 3> Description of Artificial Sequence: nucleic acid 
encoding stabilized peptide 

<400> 99 

agatctatgg aagacgaaga cgagggtgcg tcagcgtggg gagcagaact ttggtcgtgg 60 
cagtcgqtgc gtaaacgtaa ataataagaa ttc 93 

<210> 100 

<1U> 25 

■■'212 > PRT 

<213> Artificial Sequence 
-:220> 

*:22 3 > Description of Artificial Sequence: stabilize a 
pept ide 

v^OCu- 100 

Met Glu Asp Glu Asp Giu Gly Ala Ser Ala Trp Gly Ala Glu Leu Trp 
15 10 lb 

Ser Trp Gin Ser Vai Arc Lys Arg Lys 
2 0 2o 



<:310> 101 

<211> 93 

<^212:> DNA 

<213> Artifi:.;! Secuense 



WO 00/22112 PCT/V S99/23"'3I 



- 2. / 2 • Des:r;p:icr. cf Ar;:r:cial Seqje:.:e; riu:ie:c acid 
er.cC'din^ stabilized peptide 

v-Cu • ic : 

agatctacgg aagacgaaga cgctc:aggc atcgggggcg ggttggtcag g:;cact::a 
rut:ctc:c gtaaacqtaa ataataaqaa :t: 9; 

■VIC- 102 

Oil • 25 

-'222 • FRT 

-'213- Artificial Sequence 



*:2 2V- De script! or. cf Artificial Sequence: stabilized 
pept ice 

•MOO - 102 

Met Glu Asp Glu Asp Gly Leu Giy Met Giy Gly Gly Leu Val Arc Leu 
J 5 10 15 

Thr Leu Leu Phe Phe Arg Lys Arg Lys 
2 0 2 5 



<210> 103 
<211> 93 
<212> DNA 

<2I2-> Artificial Sequence 
'22G> 

<222> les:ription cf Artificial Sequence: nucleic acid 
encoding stabilized peptide 

<4 00> 10 3 

agatctatgg aagacgaaga cggggagagg atccaggggg cccgctgtcc agtaccgctg 60 
gtagatagacctaaacgtaaataataagaattc 9 3 

-■210> 104 

<-2Zl> 2 5 

- 212 > FRT 

'-21 5 : * Art i f i c i a 1 Sequence 



22 0: • 

22 5 > E'e script ion of Artificial Sequence: stabilized 
peptide 



WO 00/22112 




PCT/LS99/2373I 



Met; Gj u Asp G 1 u A 5 r 



— 1 G_ u A: q ile G:n oly A_a A:;: 



Ala Leu Yal Asp Arc Arq Lys Arq Lys 



<-2K > 105 

< .11 > 11 
^112 > PFT 

* Artificial Sequence 

< 220 - 

<:2?- Description of Artificial Sequence: s f abi ] i :>:■ ci 
pept ide 

<--3 00 - 105 

Met 'Au Asp Glu Asp Asp Arc Gly Arq Giy Arc 



<210- 106 
<211 > 93 
Gi2 > DNA 

<213- Artificial Sequence 
<220 -> 

<223> Description of Artificial Sequence: nucleic acid 
encoding stabilized peptide 

<400> 106 

aqatctatgq aagacgaaga cgacaggggg cgtgggcggt agecttaaot tgcgctaagt 60 
tgqgagatac gtaaacgtaa ataataagaa ttc 93 

<210.> 107 

Oil- 93 

<2. 12 ■• DNA 

<:213- Artificial Sequence 
<12 0 

< 123 - Description of Artificial Sequence: nucleic acic 



< 4 0 0 : ■ 1 0 7 

agatctatgg aagacgaaga egggggggee gggaggaggg cctgtctttg ttecgegett 60 
gttggggaac gtaaacgtaa ataataagaa ttc 93 



encoding stabilized peptide 



WO 00/22112 



::: > pr? 

ill * ».r:ifi:.i 



a i sequence 



PCT/US99/23731 



•*222 > Description of Artificial Sequer.ce: stabilized 
pep t ice 

•■ -IOC > 106 

Met Glu Asp Glu Asp Giy Giy Ala Gly Arc Arg Ala Cys Leu Cys Ser 
Ala Leu Val Giy Glu Arg Lys Arg Lys 



• 110 ■ 109 

•'21 1 • 90 

- 212 - DNA 

**213* Artificial Sequence 
<220-> 

<223 > Description of Artificial Sequence: nucleic acid 
encoding stabilized peptide 



•■MOO • 10 9 

agatctatgg aagacgaaga caagcgtcgc gagaggagtg caaaagggcg tcatgtcggt 60 
cggtcgatgc gtaaacgcaa ataagactgt 90 



<210 • 11C 

•;211"- 2 5 

-^212> PRT 

•21?>:- Artificial Sequence 



■-223:- Description cf Artificial Sequence: stabilized 
peptide 

<M00> 110 

Met Glu Asp Glu Asp Lys Arg Arg Glu Arg Ser Ala Lys Gly A.rg His 
1 5 10 15 



Val Giy Arc Ser Met A.rg Lys Arg Lys 

2 0 2 5 
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